Meta's AI safety system defeated by the space bar

from www.theregister.com - Articles on 2024-07-29 21:01 (#6PK23)

'Ignore previous instructions' thwarts Prompt-Guard model if you just add some good ol' ASCII code 32

Meta's machine-learning model for detecting prompt injection attacks - special prompts to make neural networks behave inappropriately - is itself vulnerable to, you guessed it, prompt injection attacks....

Source	RSS or Atom Feed
Feed Location	http://www.theregister.co.uk/headlines.atom
Feed Title	www.theregister.com - Articles
Feed Link	https://www.theregister.com/