971
Reddit stock falls for second day as references to its content in ChatGPT responses plummet
(finance.yahoo.com)
This is a most excellent place for technology news and articles.
heh I wonder if all the "old" content getting messed with and/or removed is causing issues with the algorithm/scraper.
For unauthorized scrapers? Definitely
For paid API usage? That tends to not be public for obvious reasons but, allegedly, people have, allegedly, done tests and found "deleted" content in the results.
Ive heard the same but I haven't seen real evidence anywhere, so im skeptical. But yes I agree, if they CAN get that data, it means the training data is better-ish....
But we are still on this site for a reason :)
It's all relative I guess. I can see why the original GPT's used the Reddit corpus for training. However I've always been a little sceptical about the quality of the training set in any social media given how much it exaggerates the extremes of people's behaviour.