Article 6RDT3 Anthropic's Claude vulnerable to 'emotional manipulation'

Anthropic's Claude vulnerable to 'emotional manipulation'

from www.theregister.com - Articles on 2024-10-12 10:30 (#6RDT3)

AI model safety only goes so far

Anthropic's Claude 3.5 Sonnet, despite its reputation as one of the better behaved generative AI models, can still be convinced to emit racist hate speech and malware....

External Content

Source	RSS or Atom Feed
Feed Location	http://www.theregister.co.uk/headlines.atom
Feed Title	www.theregister.com - Articles
Feed Link	https://www.theregister.com/

0 comments