Article 6RV2D xAI picked Ethernet over InfiniBand for its H100 Colossus training cluster

xAI picked Ethernet over InfiniBand for its H100 Colossus training cluster

by
from The Register on (#6RV2D)
Story ImageWork already underway to expand system to 200,000 Nvidia Hopper chips

Unlike most AI training clusters, xAI's Colossus with its 100,000 Nvidia Hopper GPUs doesn't use InfiniBand. Instead, the massive system, which Nvidia bills as the "world's largest AI supercomputer," was built using the GPU giant's Spectrum-X Ethernet fabric....

External Content
Source RSS or Atom Feed
Feed Location http://www.theregister.co.uk/headlines.atom
Feed Title The Register
Feed Link https://www.theregister.com/
Feed Copyright Copyright © 2024, Situation Publishing
Reply 0 comments