Google Rolls Out Updated AI Model Capable of Handling Longer Text, Video

msmash

from Slashdot on 2024-02-15 16:00 (#6JNE8)

An anonymous reader shares a report: Alphabet's Google is rolling out a new version of its powerful artificial intelligence model that it says can handle larger amounts of text and video than products made by competitors. The updated AI model, called Gemini 1.5 Pro, will be available on Thursday to cloud customers and developers so they can test its new features and eventually create new commercial applications. Google and its rivals have spent billions to ramp up their capabilities in generative AI and are keen to attract corporate clients to show their investments are paying off. [...] Gemini 1.5 can be trained faster and more efficiently, and has the ability to process a huge amount of information each time it's prompted, according to Vinyals. For example, developers can use Gemini 1.5 Pro to query up to an hour's worth of video, 11 hours of audio or more than 700,000 words in a document, an amount of data that Google says is the "longest context window" of any large-scale AI model yet. Gemini 1.5 can process far more data compared with what the latest AI models from OpenAI and Anthropic can handle, according to Google. In a pre-recorded video demonstration for reporters, Google showed off how engineers asked Gemini 1.5 Pro to ingest a 402-page PDF transcript of the Apollo 11 moon landing, and then prompted it to find quotes that showed "three funny moments."

Source	RSS or Atom Feed
Feed Location	https://rss.slashdot.org/Slashdot/slashdotMain
Feed Title	Slashdot
Feed Link	https://slashdot.org/
Feed Copyright	Copyright Slashdot Media. All Rights Reserved.