Largest Dataset Powering AI Images Removed After Discovery of Child Sexual Abuse Material
samleecole writes: The LAION-5B machine learning dataset used by Google, Stable Diffusion, and other major AI products has been removed by the organization that created it after a Stanford study found that it contained 3,226 suspected instances of child sexual abuse material, 1,008 of which were externally validated. LAION told 404 Media on Tuesday that out of "an abundance of caution," it was taking down its datasets temporarily "to ensure they are safe before republishing them." According to a new study by the Stanford Internet Observatory shared with 404 Media ahead of publication, the researchers found the suspected instances of CSAM through a combination of perceptual and cryptographic hash-based detection and analysis of the images themselves.
Read more of this story at Slashdot.