Jun 29, 2026
Distributed CPU AI: MPI, RDMA, NUMA, and C-Kernel-Engine
Systems note This post moves from model math into the machine that actually runs the kernels: cores, caches, NUMA domains, MPI ranks, RDMA-capable fabrics, and the memory-layout discipline n...
Read post →