Article 6D30N Meta launches Llama 2, an open source AI model that allows commercial applications

Meta launches Llama 2, an open source AI model that allows commercial applications

by
Benj Edwards
from Ars Technica - All content on (#6D30N)
cyberllama-800x448.jpg

Enlarge / An AI-generated image of a cybernetic llama. (credit: Midjourney)

On Tuesday, Meta announced Llama 2, a new open source family of AI language models notable for its commercial license, which means the models can be integrated into commercial products, unlike its predecessor. They range in size from 7 to 70 billion parameters and reportedly "outperform open source chat models on most benchmarks we tested," according to Meta.

"This is going to change the landscape of the LLM market," tweeted Chief AI Scientist Yann LeCun. "Llama-v2 is available on Microsoft Azure and will be available on AWS, Hugging Face, and other providers."

According to Meta, its Llama 2 "pretrained" models (the bare-bones models) are trained on 2 trillion tokens and have a context window of 4,096 tokens (fragments of words). The context window determines the length of the content the model can process at once. Meta also says that the Llama 2 fine-tuned models, developed for chat applications similar to ChatGPT, have been trained on "over 1 million human annotations."

Read 10 remaining paragraphs | Comments

External Content
Source RSS or Atom Feed
Feed Location http://feeds.arstechnica.com/arstechnica/index
Feed Title Ars Technica - All content
Feed Link https://arstechnica.com/
Reply 0 comments