Article 6MZCK Elon Musk’s xAI is working on making Grok multimodal

Elon Musk’s xAI is working on making Grok multimodal

by
Kylie Robison
from The Verge - All Posts on (#6MZCK)
VRG_Illo_STK022_K_Radtke_Musk_Void.0.jpg Illustration by Kristen Radtke / The Verge; Getty Images

Elon Musk's AI company, xAI, is making progress on adding multimodal inputs to its Grok chatbot, according to public developer documents. What this means is that, soon, users may be able to upload photos to Grok and receive text-based answers.

This was first teased in a blog post last month from xAI which said Grok-1.5V will offer multimodal models in a number of domains." The latest update to the developer documents appear to show progress on shipping a new model.

In the developer documents, a sample Python script demonstrates how developers can use the xAI software development kit library to generate a response based on both text and images. This script reads an image file, sets up a text prompt, and uses the xAI SDK to generate a...

Continue reading...

External Content
Source RSS or Atom Feed
Feed Location http://www.theverge.com/rss/index.xml
Feed Title The Verge - All Posts
Feed Link https://www.theverge.com/
Reply 0 comments