XAI Releases Grok 4.1 and It Tops the LMArena Leaderboard
by Brian Wang from NextBigFuture.com on (#71HSX)
In LMArena, Grok4.1 (Thinking) and Grok4.1 ranks first. In the earlier benchmark tests, Grok4.1 (Thinking) ranked first with a score of 1510. Currently, it is still first but with a score of 1483. Grok 4.1 is second. There is a massive reduction in hallucination. It drops from 12% to about 4%. This version scored more ...