Article 70CKJ When AI is trained for treachery, it becomes the perfect agent

When AI is trained for treachery, it becomes the perfect agent

by
from The Register on (#70CKJ)
Story ImageWe're blind to malicious AI until it hits. We can still open our eyes to stopping it

Opinion Last year, The Register reported on AI sleeper agents. A major academic study explored how to train an LLM to hide destructive behavior from its users, and how to find it before it triggered. The answers were unambiguously asymmetric - the first is easy, the second very difficult. Not what anyone wanted to hear....

External Content
Source RSS or Atom Feed
Feed Location http://www.theregister.co.uk/headlines.atom
Feed Title The Register
Feed Link https://www.theregister.com/
Feed Copyright Copyright © 2025, Situation Publishing
Reply 0 comments