79
What are the platforms on the Fediverse doing to prevent data scraping and prevent bots?
(piefed.blahaj.zone)
A community to talk about the Fediverse and all it's related services using ActivityPub (Mastodon, Lemmy, Mbin, etc).
If you wanted to get help with moderating your own community then head over to !moderators@lemmy.world!
Learn more at these websites: Join The Fediverse Wiki, Fediverse.info, Wikipedia Page, The Federation Info (Stats), FediDB (Stats), Sub Rehab (Reddit Migration)
I don't think Anthropic or OpenAI have spent the time developing a custom ingest pipeline for such a small dataset. It doesn't seem like it'd give much enough of a return on investment.
I dunno, we had 1.8 billion posts and 50 million comments from 1.1 million MAUs in June according to the fediverse observer. It's not nothing.
Yeah, for them that's small potatoes.
Given that they are scrabbling around like drug addicts looking for anything they've split, including checking the cracks in the floorboards...
For some models, it's obvious they've long scrapped the erotic fan fic sites!