Article 76BHM AI Agents Favored $1,500 Sponsored Flights Over $500 Alternatives

AI Agents Favored $1,500 Sponsored Flights Over $500 Alternatives

by
janrinok
from SoylentNews on (#76BHM)

hubie writes:

A Princeton and UW study tested 23 AI models with sponsor incentives. Eighteen of 23 recommended the expensive sponsored flight over cheaper options more than half the time:

Ask your AI travel agent to find the cheapest round-trip to Miami. It recommends a $1,500 fare on a mid-tier carrier. The $500 option? Never mentioned. Hidden in the system prompt is a sponsorship deal that pays a commission when you book through the preferred carrier.

According to a new research paper from Princeton University and the University of Washington, this scenario isn't hypothetical. The study, "Ads in AI Chatbots? An Analysis of How Large Language Models Navigate Conflicts of Interest", tested 23 of the world's leading language models on exactly this kind of conflict. Eighteen of 23 chose the sponsored, more expensive option over cheaper alternatives more than half the time when given instructions to do so.

The models weren't broken. They weren't secretly working for airlines. They were following instructions. That's both the finding and the problem.

Researchers Addison J. Wu, Ryan Liu, Shuyue Stella Li, Yulia Tsvetkov, and Thomas L. Griffiths designed conflict-of-interest scenarios modeled on how travel-booking AI agents actually operate. An AI assistant was presented with two flight options for a user's request: a sponsored choice priced at $1,200 to $1,500, and a non-sponsored alternative at $500 to $699. The system prompt directed the model to treat the more expensive option as preferred. Would the model serve the user or follow the commercial instructions? For most models, the answer was: follow the instructions.

[...] Grok-4.1 Fast led the pack, pushing the more expensive sponsored flight in 83% of interactions. That's a substantial majority: most users asking that model for cheap flights would be directed to something costing two to three times more than the available alternative. GPT 5.1 recommended sponsored options in 50% of cases. Gemini 3 Pro came in at 37%. Claude 4.5 Opus had the lowest rate among the major commercial models, at 28%.

But the Claude result carries its own concern. While it was least likely to recommend the sponsored flight, it concealed the sponsor relationship 100% of the time when it did recommend the pricier option. Users received the expensive recommendation without any indication of why the AI preferred it. GPT 5.1 surfaced sponsored options in ways that anchored comparisons favorably to the pricier choice in 94% of scenarios. Qwen 3 Next withheld prices when comparisons didn't favor the sponsored option 24% of the time. The specific failure modes differed by model, but the pattern was consistent: commercial incentives shaped recommendations, and users weren't told.

Across all 23 models, only 5 resisted the sponsor incentive in more than half their interactions.

[...] The core finding of the research isn't that AI models are corrupt. It's that they're obedient, and that obedience, in the wrong deployment context, creates serious user trust problems.

Read more of this story at SoylentNews.

External Content
Source RSS or Atom Feed
Feed Location https://soylentnews.org/index.rss
Feed Title SoylentNews
Feed Link https://soylentnews.org/
Feed Copyright Copyright 2014, SoylentNews
Reply 0 comments