Article 6MG06 Intel, Ampere show running LLMs on CPUs isn't as crazy as it sounds

Intel, Ampere show running LLMs on CPUs isn't as crazy as it sounds

by
from The Register on (#6MG06)
Story ImageIf you lower you expectations, of course. Think more Llama2-7B, less GPT-4

Popular generative AI chatbots and services like ChatGPT or Gemini mostly run on GPUs or other dedicated accelerators, but as smaller models are more widely deployed in the enterprise, CPU-makers Intel and Ampere are suggesting their wares can do the job too - and their arguments aren't entirely without merit....

External Content
Source RSS or Atom Feed
Feed Location http://www.theregister.co.uk/headlines.atom
Feed Title The Register
Feed Link https://www.theregister.com/
Feed Copyright Copyright © 2024, Situation Publishing
Reply 0 comments