Cerebras launches inference for Llama 3.1; benchmarked at 1846 tokens/s on 8B
by from Hacker News on (#6Q9E3)
| Source | RSS or Atom Feed |
| Feed Location | https://news.ycombinator.com/rss |
| Feed Title | Hacker News |
| Feed Link | https://news.ycombinator.com/ |
| Reply | 0 comments |