Article 6H2KQ Boffins fool AI chatbot into revealing harmful content – with 98 percent success rate

Boffins fool AI chatbot into revealing harmful content – with 98 percent success rate

by
from The Register on (#6H2KQ)
Story ImageThis one weird trick works every time, most of the time

Investigators at Indiana's Purdue University have devised a way to interrogate large language models (LLMs) in a way that that breaks their etiquette training - almost all the time....

External Content
Source RSS or Atom Feed
Feed Location http://www.theregister.co.uk/headlines.atom
Feed Title The Register
Feed Link https://www.theregister.com/
Feed Copyright Copyright © 2024, Situation Publishing
Reply 0 comments