Article 6ZPZ0 LegalPwn: Tricking LLMs by burying badness in lawyerly fine print

LegalPwn: Tricking LLMs by burying badness in lawyerly fine print

by
from The Register on (#6ZPZ0)
Story ImageTrust and believe - AI models trained to see 'legal' doc as super legit

Researchers at security firm Pangea have discovered yet another way to trivially trick large language models (LLMs) into ignoring their guardrails. Stick your adversarial instructions somewhere in a legal document to give them an air of unearned legitimacy - a trick familiar to lawyers the world over....

External Content
Source RSS or Atom Feed
Feed Location http://www.theregister.co.uk/headlines.atom
Feed Title The Register
Feed Link https://www.theregister.com/
Feed Copyright Copyright © 2025, Situation Publishing
Reply 0 comments