XAI Grok 1.5 Vision Leverages Tesla FSD to Understand the Real World
by Brian Wang from NextBigFuture.com on (#6M80P)
Grok-1.5V is competitive with existing frontier multimodal models in a number of domains, ranging from multi-disciplinary reasoning to understanding documents, science diagrams, charts, screenshots, and photographs. Grok has capabilities in understanding our physical world. Grok outperforms its peers in our new RealWorldQA benchmark that measures real-world spatial understanding. For all datasets below, they evaluate Grok ...