Thumbnail 1574403
thumbnail
Large (256x256)

Articles

Cerebras Inference now 3x faster: Llama3.1-70B breaks 2,100 tokens/s
Comments
Llama 3.1 405B now runs at 969 tokens/s on Cerebras Inference
Comments
1