Article 74W30 AI models are terrible at betting on soccer—especially xAI Grok

AI models are terrible at betting on soccer—especially xAI Grok

by
Tim Bradshaw, Financial Times
from Ars Technica - All content on (#74W30)

AI models from Google, OpenAI, and Anthropic lost money betting on soccer matches over a Premier League season, in a new study suggesting even the most advanced systems struggle to analyze the real world over long periods.

The KellyBench" report released this week by AI start-up General Reasoning highlights the gap between AI's rapidly advancing capabilities in certain tasks, such as writing software, and its shortcomings in other kinds of human problems.

London-based General Reasoning tested eight top AI systems in a virtual re-creation of the 2023-24 Premier League season, providing them with detailed historical data and statistics about each team and previous games. The AIs were instructed to build models that would maximize returns and manage risk.

Read full article

Comments

External Content
Source RSS or Atom Feed
Feed Location http://feeds.arstechnica.com/arstechnica/index
Feed Title Ars Technica - All content
Feed Link https://arstechnica.com/
Reply 0 comments