Ask Lemmy
A Fediverse community for open-ended, thought provoking questions
Rules: (interactive)
1) Be nice and; have fun
Doxxing, trolling, sealioning, racism, and toxicity are not welcomed in AskLemmy. Remember what your mother said: if you can't say something nice, don't say anything at all. In addition, the site-wide Lemmy.world terms of service also apply here. Please familiarize yourself with them
2) All posts must end with a '?'
This is sort of like Jeopardy. Please phrase all post titles in the form of a proper question ending with ?
3) No spam
Please do not flood the community with nonsense. Actual suspected spammers will be banned on site. No astroturfing.
4) NSFW is okay, within reason
Just remember to tag posts with either a content warning or a [NSFW] tag. Overtly sexual posts are not allowed, please direct them to either !asklemmyafterdark@lemmy.world or !asklemmynsfw@lemmynsfw.com.
NSFW comments should be restricted to posts tagged [NSFW].
5) This is not a support community.
It is not a place for 'how do I?', type questions.
If you have any questions regarding the site itself or would like to report a community, please direct them to Lemmy.world Support or email info@lemmy.world. For other questions check our partnered communities list, or use the search function.
6) No US Politics.
Please don't post about current US Politics. If you need to do this, try !politicaldiscussion@lemmy.world or !askusa@discuss.online
Reminder: The terms of service apply here too.
Partnered Communities:
Logo design credit goes to: tubbadu
view the rest of the comments
I came here specifically to get away from the chatbot daycare hellhole that reddit became. Share some of your insights about these accounts and I'll tell you a little about why reddit got so bad. Fediverse doesn't really offer the same kind of incentive to somebody who's trying to train an LLM on comments but who knows.
On reddit, the biggest incentive for people to want to train LLM's is just the sheer amount of data there. Reddit is insanely big and the karma system is basically a "weight" value similar to how neural networks already categorize info. Even if somebody notices the obvious bot account, enough people there will still interact with the bot sincerely that it gets the interaction it's trying to provoke every time.
Also it's easy as hell to set one up to run on reddit. Simply verify an email address, subscribe to r/newtoreddit and and bunch of other subs that don't require karma to comment, and then only give votes for the first month before finally starting to leave comments. Reddit claims to screen for bot accounts but deviating from this specific pattern of conduct is something that gets new users comments flagged for review. Reddit is actually only screening real people.
If you want to talk real tinfoil hat shit, this is probably by design. Chatbots drive up traffic and interaction not just with eachother but specifically with the humans that will also severely inflate usage statistics to look good to advertisers. the ones who leave comments following common "redditisms" and patterns of discussion over and over and over and never get sick of saying the same things.
Basically, I'm hoping none of these conditions exist here. So far doesn't seem like it since fediverse isn't hiding ads as posts, blocking VPN users, or taking such a heavy handed involvement in moderation.
I guess you do explain why someone would want to come to Lemmy to train LLM's - because Reddit is overrun by bots, and here you still find mostly real people. Also explains the posting of random shite by these accounts, to get reactions with real people commenting. Sorry if I sound rather naive, it's because I am. I wasn't meant to exist in this timeline.
That is a possibility. Data from interacting with actual humans reduces the rate of model degradation. Maybe somebody does feel like they would get better results here. But they'd have to go to the trouble of sending requests to join instances and federate communities. It's not a whole lot of work but it's slightly more overhead for a website that gets way less hits than reddit as of now.
You're not naive dude, you're living in unprecedented times. It's sad to see people get jumpy at the idea that all of our interactions are becoming simulations of real ones but in some places it literally happened. I don't even fuck with instagram, facebook, or tiktok because I've seen the brainrot there that got created because the platform incentivised it. Stay curious and don't let the bastards grind you down👍