Human Feedback Makes AI Better at Deceiving Humans, Study Shows by Todd Feathers from Gizmodo on 2024-09-27 15:15 (#6R2BC) In a preprint study, researchers found that training a language model with human feedback teaches the model to generate incorrect responses that trick humans.