Latest · June 21, 2026

Everyone's Watching Training Move. Inference Can't Follow.

xAI built a 100,000-GPU cluster in Memphis in 122 days because the city had power and could deliver it fast. That choice marks a permanent split in AI compute into two machines that obey different laws of physics.

9 min read
Read article
Conceptual editorial illustration for Why AI compute is cleaving into a power-bound training core and a latency-bound inference edge
Because these models run on our edge network, inference happens close to your users, which reduces latency.
Cloudflare · Workers AI

From the archive

Contact

Found a mistake, or have something to share?

Email admin@secondorderlabs.com. We read what comes in and reply when we can.