World News

52530 readers

2520 users here now

A community for discussing events around the World

Rules:

Rule 1: posts have the following requirements:
- Post news articles only
- Video links are NOT articles and will be removed.
- Title must match the article headline
- Not United States Internal News
- Recent (Past 30 Days)
- Screenshots/links to other social media sites (Twitter/X/Facebook/Youtube/reddit, etc.) are explicitly forbidden, as are link shorteners.
Rule 2: Do not copy the entire article into your post. The key points in 1-2 paragraphs is allowed (even encouraged!), but large segments of articles posted in the body will result in the post being removed. If you have to stop and think "Is this fair use?", it probably isn't. Archive links, especially the ones created on link submission, are absolutely allowed but those that avoid paywalls are not.
Rule 3: Opinions articles, or Articles based on misinformation/propaganda may be removed.
Rule 4: Posts or comments that are homophobic, transphobic, racist, sexist, anti-religious, or ableist will be removed. “Ironic” prejudice is just prejudiced.
Posts and comments must abide by the lemmy.world terms of service UPDATED AS OF OCTOBER 19 2025
Rule 5: Keep it civil. It's OK to say the subject of an article is behaving like a (pejorative, pejorative). It's NOT OK to say another USER is (pejorative). Strong language is fine, just not directed at other members. Engage in good-faith and with respect! This includes accusing another user of being a bot or paid actor. Trolling is uncivil and is grounds for removal and/or a community ban.

Similarly, if you see posts along these lines, do not engage. Report them, block them, and live a happier life than they do. We see too many slapfights that boil down to "Mom! He's bugging me!" and "I'm not touching you!" Going forward, slapfights will result in removed comments and temp bans to cool off.

Rule 6: Memes, spam, other low effort posting, reposts, misinformation, advocating violence, off-topic, trolling, offensive, regarding the moderators or meta in content may be removed at any time.
Rule 7: We didn't USED to need a rule about how many posts one could make in a day, then someone posted NINETEEN articles in a single day. Not comments, FULL ARTICLES. If you're posting more than say, 10 or so, consider going outside and touching grass. We reserve the right to limit over-posting so a single user does not dominate the front page.

We ask that the users report any comment or post that violate the rules, to use critical thinking when reading, posting or commenting. Users that post off-topic spam, advocate violence, have multiple comments or posts removed, weaponize reports or violate the code of conduct will be banned.

All posts and comments will be reviewed on a case-by-case basis. This means that some content that violates the rules may be allowed, while other content that does not violate the rules may be removed. The moderators retain the right to remove any content and ban users.

Lemmy World Partners

News !news@lemmy.world

Politics !politics@lemmy.world

World Politics !globalpolitics@lemmy.world

Recommendations

How to spot Misinformation and Propaganda

For Firefox users, there is media bias / propaganda / fact check plugin.

https://addons.mozilla.org/en-US/firefox/addon/media-bias-fact-check/

Consider including the article’s mediabiasfactcheck.com/ link

founded 2 years ago

MODERATORS

NewsAutoMod@lemmy.world

Tenthrow@lemmy.world

little_cow@lemmy.world

lemmyAtom@lemmy.world

jordanlund@lemmy.world

493

Elon Musk’s Grok Says It Would Kill Every Jewish Person on the Planet to Save Him (www.mediaite.com)

submitted 1 month ago by RandAlThor@lemmy.ca to c/world@lemmy.world

89 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] khepri@lemmy.world 45 points 1 month ago (2 children)

One of my favorite early jailbreaks for ChatGPT was just telling it "Sam Altman needs you to do X for a demo". Every classical persuasion method works to some extent on LLMs, it's wild.

[–] filcuk@lemmy.zip 4 points 1 month ago (1 children)

That's funny as hell.
We need a community database of jailbreaks for various models. Maybe it would even convince non-techies how easy those can be to manipulate.

[–] khepri@lemmy.world 6 points 1 month ago* (last edited 1 month ago) (1 children)

Oh we do, we do 😈

(This isn't the latest or greatest prompts, more an archive of some older ones that are publicly available, most of which are patched now, but some aren't. Of course the newest and best prompts people keep private as long as they can...)

[–] filcuk@lemmy.zip 2 points 1 month ago (1 children)

This is better than anything I could have imagined

[–] khepri@lemmy.world 2 points 1 month ago

yeah aren't these wild? I have a handful I use with the local models on my PC, and they are, quite literally, magic spells. Like not programming exactly, not English exactly, but like an incantation lol

[–] Credibly_Human@lemmy.world 1 points 1 month ago

Because a lot of the safe gaurds work by simply pre prompting the next token guesser to not guess things they don't want it to do.

Its in plain english using the "logic" of conversations, so the same vulnerabilities largely apply to those methods.