Article 76HT5 OpenAI and Broadcom announce chip designed for LLM inference at scale

OpenAI and Broadcom announce chip designed for LLM inference at scale

by
Samuel Axon
from Ars Technica - All content on (#76HT5)

OpenAI, the company behind ChatGPT and Codex and the models those tools utilize, and Broadcom, an established silicon supplier, have announced a new chip called Jalapeno, designed specifically for large language model inference in data centers.

The chip is intended to be deployed at large data centers, both companies claim this is just the first generation in a long-term project that will see chips refined over time.

Read full article

Comments

External Content
Source RSS or Atom Feed
Feed Location http://feeds.arstechnica.com/arstechnica/index
Feed Title Ars Technica - All content
Feed Link https://arstechnica.com/
Reply 0 comments