Article 6Y7DA Anthropic: All the major AI models will blackmail us if pushed hard enough

Anthropic: All the major AI models will blackmail us if pushed hard enough

by
from The Register on (#6Y7DA)
Story ImageJust like people

Anthropic published research last week showing that all major AI models may resort to blackmail to avoid being shut down - but the researchers essentially pushed them into the undesired behavior through a series of artificial constraints that forced them into a binary decision....

External Content
Source RSS or Atom Feed
Feed Location http://www.theregister.co.uk/headlines.atom
Feed Title The Register
Feed Link https://www.theregister.com/
Feed Copyright Copyright © 2025, Situation Publishing
Reply 0 comments