Article 75V83 Hackers are learning to exploit chatbot ‘personalities’

Hackers are learning to exploit chatbot ‘personalities’

by
Robert Hart
from The Verge on (#75V83)
STK414_AI_CVIRGINIA_I__0005_3.png?quality=90&strip=all&crop=0,0,100,100

This is The Stepback, a weekly newsletter breaking down one essential story from the tech world. For more on AI mischief, follow Robert Hart. The Stepback arrives in our subscribers' inboxes at 8AM ET. Opt in for The Stepback here.

How it started

Hacking the first generation of AI chatbots was a laughably simple affair. You didn't need any technical know-how, backdoor access, or even a basic understanding of what a large language model was. You didn't need to code. To get an AI system that had cost billions to build to abandon its safety instructions, sometimes all you had to do was ask.

These attacks, known as jailbreaks, had the quality ...

Read the full story at The Verge.

External Content
Source RSS or Atom Feed
Feed Location http://www.theverge.com/rss/index.xml
Feed Title The Verge
Feed Link https://www.theverge.com/
Reply 0 comments