Discussion about this post

User's avatar
WZ's avatar
Jan 9Edited

I just want to say I enjoyed the reading for all available 6 episodes at this point. Best series that put the two camps of idea/architecture side by side for comparison and more importantly to understand the reasons behind the design choices.

The AI Architect's avatar

Exceptional breakdown. The Rail-Optimized topology insight is underrated because most people still think about AI networking through datacenter lens when it's fundamentally diferent. What struck me is how NVIDIA's roadmap (72 to 572 GPUs per rack) is basically them admitting that the less traffic hits Domain 2, teh better, which kinda validates Google's "make Domain 1 huge" strategy from the opposite direction.

No posts

Ready for more?