Article 6MXV6 AI chatbots’ safeguards can be easily bypassed, say UK researchers

AI chatbots’ safeguards can be easily bypassed, say UK researchers

by
Dan Milmo Global technology editor
from Technology | The Guardian on (#6MXV6)

All five systems tested were found to be highly vulnerable' to attempts to elicit harmful responses

Guardrails to prevent artificial intelligence models behind chatbots from issuing illegal, toxic or explicit responses can be bypassed with simple techniques, UK government researchers have found.

The UK's AI Safety Institute (AISI) said systems it had tested were highly vulnerable" to jailbreaks, a term for text prompts designed to elicit a response that a model is supposedly trained to avoid issuing.

Continue reading...
External Content
Source RSS or Atom Feed
Feed Location http://www.theguardian.com/technology/rss
Feed Title Technology | The Guardian
Feed Link https://www.theguardian.com/us/technology
Feed Copyright Guardian News and Media Limited or its affiliated companies. All rights reserved. 2024
Reply 0 comments