Article 6DNR2 How to spot OpenAI's crawler bot and stop it slurping sites for training data

How to spot OpenAI's crawler bot and stop it slurping sites for training data

by
from The Register on (#6DNR2)
Story ImageAww, c'mon, let us scrape your pages, we've got billions at stake

OpenAI, the maker of machine learning models trained on public web data, has published the specifications for its web crawler so that publishers and site owners can opt out of having their content scraped....

External Content
Source RSS or Atom Feed
Feed Location http://www.theregister.co.uk/headlines.atom
Feed Title The Register
Feed Link https://www.theregister.com/
Feed Copyright Copyright © 2024, Situation Publishing
Reply 0 comments