this post was submitted on 19 Aug 2025
850 points (99.3% liked)

Technology

74247 readers
4206 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
(page 4) 50 comments
sorted by: hot top controversial new old
[–] interdimensionalmeme@lemmy.ml 8 points 2 days ago (6 children)
load more comments (6 replies)
[–] dzajew@piefed.social 7 points 2 days ago

Cry me a river

[–] poopkins@lemmy.world -3 points 1 day ago* (last edited 1 day ago) (2 children)

I've developed my own agent for assisting me with researching a topic I'm passionate about, and I ran into the exact same barrier: Cloudflare intercepts my request and is clearly checking if I'm a human using a web browser. (For my network requests, I've defined my own user agent.)

So I use that as a signal that the website doesn't want automated tools scraping their data. That's fine with me: my agent just tells me that there might be interesting content on the site and gives me a deep link. I can extract the data and carry on my research on my own.

I completely understand where Perplexity is coming from, but at scale, implementations like ~~this~~ Perplexity's are awful for the web.

(Edited for clarity)

load more comments (2 replies)
[–] starchylemming@lemmy.world 3 points 2 days ago

next step: cloudflare sends hit squads to blow up the source of these slimy data grabber attacks

load more comments
view more: ‹ prev next ›