Cerebras Inference now 3x faster: Llama3.1-70B breaks 2,100 tokens/s
by from Hacker News on (#6RQRE)
Source | RSS or Atom Feed |
Feed Location | https://news.ycombinator.com/rss |
Feed Title | Hacker News |
Feed Link | https://news.ycombinator.com/ |
Reply | 0 comments |