this post was submitted on 11 Aug 2025
889 points (99.0% liked)

Technology

73967 readers
3695 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] Midnight1938@reddthat.com 5 points 15 hours ago (1 children)

Circumventing sites with 'no ai scraping' rules

[–] General_Effort@lemmy.world 1 points 15 hours ago (1 children)

And what do I care about Reddit getting paid?

If the IA doesn't complain about being used, then it's fine for me. The ideal outcome would be, if the archive can make some arrangement where they scrape the data and provide it to everyone. That way, sites only get scraped once and not constantly hammered.

[–] buddascrayon@lemmy.world 2 points 4 hours ago

There are plenty of sites out there not owned by major conglomerates that have norobots and noscrape tags that AI companies can use Wayback as a way to circumvent their policies.

This isn't about reddit, it's about AI companies stealing everything on the internet and then selling it back to you while taking your job away.