Article 71HSX XAI Releases Grok 4.1 and It Tops the LMArena Leaderboard

XAI Releases Grok 4.1 and It Tops the LMArena Leaderboard

by
Brian Wang
from NextBigFuture.com on (#71HSX)
In LMArena, Grok4.1 (Thinking) and Grok4.1 ranks first. In the earlier benchmark tests, Grok4.1 (Thinking) ranked first with a score of 1510. Currently, it is still first but with a score of 1483. Grok 4.1 is second. There is a massive reduction in hallucination. It drops from 12% to about 4%. This version scored more ...

Read more

External Content
Source RSS or Atom Feed
Feed Location http://feeds.feedburner.com/blogspot/advancednano
Feed Title NextBigFuture.com
Feed Link https://www.nextbigfuture.com/
Reply 0 comments