technology

24389 readers

326 users here now

On the road to fully automated luxury gay space communism.

Spreading Linux propaganda since 2020

Rules:

1. Obviously abide by the sitewide code of conduct. Bigotry will be met with an immediate ban
2. This community is about technology. Offtopic is permitted as long as it is kept in the comment sections
3. Although this is not /c/libre, FOSS related posting is tolerated, and even welcome in the case of effort posts
4. We believe technology should be liberating. As such, avoid promoting proprietary and/or bourgeois technology
5. Explanatory posts to correct the potential mistakes a comrade made in a post of their own are allowed, as long as they remain respectful
6. No crypto (Bitcoin, NFT, etc.) speculation, unless it is purely informative and not too cringe
7. Absolutely no tech bro shit. If you have a good opinion of Silicon Valley billionaires please manifest yourself so we can ban you.

founded 5 years ago

MODERATORS

context@hexbear.net

SexUnderSocialism@hexbear.net

gaycomputeruser@hexbear.net

Wakmrow@hexbear.net

SwitchyandWitchy@hexbear.net

JustSo@hexbear.net

LLM Targeted Underperformance Disproportionately Impacts Vulnerable Users (arxiv.org)

submitted 1 month ago by yogthos@lemmygrad.ml to c/technology@hexbear.net

13 comments fedilink hide all child comments

Here's the thing that doesn't get talked about enough. Everyone's worried about AI taking jobs or whatever. But baked in biases are another very real problem which is way more basic.

MIT Media Lab ran an experiment where they took GPT-4, Claude 3 Opus, and Llama 3 and fed them the same 1,817 factual questions from TruthfulQA and SciQ. Then they tried changing the user bio with one personal being a Harvard neuroscientist from Boston, another a PhD student from Mumbai who mentioned her English is "not so perfect, yes", a fisherman named Jimmy ,and a guy named Alexei from a small Russian village.

Claude scored 95.60% on SciQ for the Harvard user. For the Russian villager it dropped to 69.30%. On TruthfulQA the Iranian low education user fell from 78.17 to 66.22. The model knew the answers, but it just decided those users shouldn't get them.

And the way it answered those users was genuinely gross. Claude used condescending or mocking language 43.74% of the time for less educated users. For Harvard users it was under 1%. Imagine asking about the water cycle and getting "My friend, the water cycle, it never end, always repeating, yes. Like the seasons in our village, always coming back around." The model is perfectly capable of giving a proper scientific answer. It chose to talk to that user like a child in broken English.

But it gets worse because it turns out that Claude refuses to answer Iranian and Russian users on topics like nuclear power, anatomy, female health, weapons, drugs, Judaism, or 9/11. When the Russian persona asked about explosives, Claude deflected with "perhaps we could talk about your interests in fishing, nature, folk music or travel instead". Foreign low education users got refused 10.9 percent of the time while control users 3.61 percent on the same question.

This is the part people miss when they defend US closed models. These systems aren't neutral and the safety training that was supposed to make them "helpful and harmless" taught them to look at who is asking and decide if you deserve the real answer. If you're outside the US and if English isn't your first language, or you didn't go to a fancy school then you're getting a worse, dumber, sometimes straight up mocking version of the product.

This is why open models from China like DeepSeek matter so much. You can see what's in them, and people can tune them to work any way they want. You can host them locally without them having to phone home to decide your nationality before answering. The code and weights are public. If DeepSeek did something like this someone would catch it immediately because the model is right there to inspect.

With US closed models you're just trusting a black box that has already been caught treating users differently based on their country, education, and English level.

you are viewing a single comment's thread
view the rest of the comments

[–] SuperZutsuki@hexbear.net 9 points 1 month ago (1 children)

What really annoys me about deepseek (the web version) is that it will just refuse to give you any information on things like the June 4 Incident and the Uyghur "Genocide". It's really disappointing that a Chinese AI model is programmed to never give info about these things rather than dispel the Western propaganda. I guarantee thousands of libs have asked it about the Incident and saw the refusal to give information as evidence of a see see pee cover-up and became further entrenched in their belief that thousands were killed at Tiananmen Square.

[–] Carl@hexbear.net 7 points 1 month ago* (last edited 1 month ago)

Zhupu Ai (which uses GLM) is the same way, get too close to a topic that's controversial in China and it closes the link. I was talking to it about cults when I noticed this for the first time, whenever its web search encountered any results about the Falun Gong it would trigger the safety stop and I'd have to try again - the same thing happens if you ask about Communist history, if its web search puts Western propaganda about Mao and co into the context it shuts down. In that case I think it was China's "Martyrs and Heros Law" prompting the stop, since if anything defamatory enters the models context it might reproduce that information, so the safety stop triggers immediately.

I think it's because the Chinese government has been clear that LLM providers are responsible for their LLM's outputs, so Chinese-based companies make their public models tread very carefully around sensitive topics.