Article 6YYH3 Positron AI says its Atlas accelerator beats Nvidia H200 on inference in just 33% of the power — delivers 280 tokens per second per user with Llama 3.1 8B in 2000W envelope

Positron AI says its Atlas accelerator beats Nvidia H200 on inference in just 33% of the power — delivers 280 tokens per second per user with Llama 3.1 8B in 2000W envelope

ashilov@gmail.com (Anton Shilov)

from Latest from Tom's Hardware on 2025-07-28 16:38 (#6YYH3)

Cloudflare is testing Positron AI's Atlas machine based on Archer accelerators, an inference-only solution that claims to outperform Nvidia's H200 DGX using one-third the power.

External Content

Source	RSS or Atom Feed
Feed Location	https://www.tomshardware.com/feeds/all
Feed Title	Latest from Tom's Hardware
Feed Link	https://www.tomshardware.com/feeds.xml

0 comments