Article 70WCZ Alibaba Cloud says it cut Nvidia AI GPU use by 82% with new pooling system— up to 9x increase in output lets 213 GPUs perform like 1,192

Alibaba Cloud says it cut Nvidia AI GPU use by 82% with new pooling system— up to 9x increase in output lets 213 GPUs perform like 1,192

by
from Latest from Tom's Hardware on (#70WCZ)
Story ImageAlibaba Cloud claims its new Aegaeon pooling system reduced the number of Nvidia GPUs required to serve large language models by 82% during a multi-month beta test inside its Model Studio marketplace.
External Content
Source RSS or Atom Feed
Feed Location https://www.tomshardware.com/feeds/all
Feed Title Latest from Tom's Hardware
Feed Link https://www.tomshardware.com/
Reply 0 comments