• Nvidia and xAI collaborate on Colossus development
  • xAI has markedly cut down ‘flow collisions’ during AI model training
  • Spectrum-X has been crucial in training the Grok AI model family

Nvidia has shed light on how xAI’s ‘Colossus’ supercomputer cluster can keep a handle on 100,000 Hopper GPUs – and it’s all down to using the chipmaker’s Spectrum-X Ethernet networking platform.

Spectrum-X, the company revealed, is designed to provide massive performance capabilities to multi-tenant, hyperscale AI factories using its Remote Directory Memory Access (RDMA) network.



Source link

Leave a Reply

Your email address will not be published. Required fields are marked *