Article 6FGH5 AI safety guardrails easily thwarted, security study finds

AI safety guardrails easily thwarted, security study finds

by
from The Register on (#6FGH5)
Story ImageOpenAI GPT-3.5 Turbo chatbot defenses dissolve with '20 cents' of API tickling

The "guardrails" created to prevent large language models (LLMs) such as OpenAI's GPT-3.5 Turbo from spewing toxic content have been shown to be very fragile....

External Content
Source RSS or Atom Feed
Feed Location http://www.theregister.co.uk/headlines.atom
Feed Title The Register
Feed Link https://www.theregister.com/
Feed Copyright Copyright © 2024, Situation Publishing
Reply 0 comments