<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"><channel><title>fp4</title><description>The engineering layer of AI. Deep technical coverage of LLM, GPU, and ML systems internals.</description><link>https://fp4.dev/</link><language>en-us</language><item><title>H100 vs H200 vs B200: TCO for Inference Infrastructure</title><link>https://fp4.dev/silicon/h100-h200-b200-tco/</link><guid isPermaLink="true">https://fp4.dev/silicon/h100-h200-b200-tco/</guid><description>Beyond the spec sheet: deriving actual cost per million tokens for each generation, accounting for memory capacity, bandwidth, rack power, and cooling — the numbers that determine your infrastructure decision.</description><pubDate>Mon, 22 Jun 2026 00:00:00 GMT</pubDate><category>silicon</category><category>h100</category><category>h200</category><category>b200</category><category>tco</category><category>inference</category><category>hardware</category><category>cost</category></item><item><title>Intra-node vs Inter-node Interconnects in Distributed Training</title><link>https://fp4.dev/system/distributed-interconnects/</link><guid isPermaLink="true">https://fp4.dev/system/distributed-interconnects/</guid><description>NVLink, NVSwitch, InfiniBand, and RoCE — the bandwidth and latency numbers that determine whether your distributed training job scales or stalls.</description><pubDate>Sat, 20 Jun 2026 00:00:00 GMT</pubDate><category>system</category><category>nvlink</category><category>infiniband</category><category>roce</category><category>distributed-training</category><category>collective-ops</category><category>nccl</category></item><item><title>GPU Memory Hierarchy and Kernel Performance</title><link>https://fp4.dev/silicon/gpu-memory-hierarchy/</link><guid isPermaLink="true">https://fp4.dev/silicon/gpu-memory-hierarchy/</guid><description>Why memory bandwidth — not FLOPs — is the binding constraint for most LLM workloads, and how H100&apos;s five-level hierarchy determines what your kernels can actually achieve.</description><pubDate>Thu, 18 Jun 2026 00:00:00 GMT</pubDate><category>silicon</category><category>gpu</category><category>memory</category><category>hbm</category><category>sram</category><category>bandwidth</category><category>kernels</category></item></channel></rss>