When AI is trained for treachery, it becomes the perfect agent

from The Register on 2025-09-29 07:15 (#70CKJ)

We're blind to malicious AI until it hits. We can still open our eyes to stopping it

Opinion Last year, The Register reported on AI sleeper agents. A major academic study explored how to train an LLM to hide destructive behavior from its users, and how to find it before it triggered. The answers were unambiguously asymmetric - the first is easy, the second very difficult. Not what anyone wanted to hear....

Source	RSS or Atom Feed
Feed Location	http://www.theregister.co.uk/headlines.atom
Feed Title	The Register
Feed Link	https://www.theregister.com/
Feed Copyright	Copyright © 2026, Situation Publishing