this post was submitted on 15 May 2025
25 points (93.1% liked)

No Stupid Questions

40697 readers
1352 users here now

No such thing. Ask away!

!nostupidquestions is a community dedicated to being helpful and answering each others' questions on various topics.

The rules for posting and commenting, besides the rules defined here for lemmy.world, are as follows:

Rules (interactive)


Rule 1- All posts must be legitimate questions. All post titles must include a question.

All posts must be legitimate questions, and all post titles must include a question. Questions that are joke or trolling questions, memes, song lyrics as title, etc. are not allowed here. See Rule 6 for all exceptions.



Rule 2- Your question subject cannot be illegal or NSFW material.

Your question subject cannot be illegal or NSFW material. You will be warned first, banned second.



Rule 3- Do not seek mental, medical and professional help here.

Do not seek mental, medical and professional help here. Breaking this rule will not get you or your post removed, but it will put you at risk, and possibly in danger.



Rule 4- No self promotion or upvote-farming of any kind.

That's it.



Rule 5- No baiting or sealioning or promoting an agenda.

Questions which, instead of being of an innocuous nature, are specifically intended (based on reports and in the opinion of our crack moderation team) to bait users into ideological wars on charged political topics will be removed and the authors warned - or banned - depending on severity.



Rule 6- Regarding META posts and joke questions.

Provided it is about the community itself, you may post non-question posts using the [META] tag on your post title.

On fridays, you are allowed to post meme and troll questions, on the condition that it's in text format only, and conforms with our other rules. These posts MUST include the [NSQ Friday] tag in their title.

If you post a serious question on friday and are looking only for legitimate answers, then please include the [Serious] tag on your post. Irrelevant replies will then be removed by moderators.



Rule 7- You can't intentionally annoy, mock, or harass other members.

If you intentionally annoy, mock, harass, or discriminate against any individual member, you will be removed.

Likewise, if you are a member, sympathiser or a resemblant of a movement that is known to largely hate, mock, discriminate against, and/or want to take lives of a group of people, and you were provably vocal about your hate, then you will be banned on sight.



Rule 8- All comments should try to stay relevant to their parent content.



Rule 9- Reposts from other platforms are not allowed.

Let everyone have their own content.



Rule 10- Majority of bots aren't allowed to participate here. This includes using AI responses and summaries.



Credits

Our breathtaking icon was bestowed upon us by @Cevilia!

The greatest banner of all time: by @TheOneWithTheHair!

founded 2 years ago
MODERATORS
 

@nostupidquestions How does generative AI handle creoles? Does it get confused and respond like it's one of the parent languages?

top 10 comments
sorted by: hot top controversial new old
[–] yesman@lemmy.world 4 points 1 day ago (1 children)

What you won't find in the dictionary difference between dialects or creoles is historically racialized. A good example is Louisiana. Technically, Cajun is a subset of creole. But in practice Cajuns are white and Creoles are black.

https://hnoc.org/publishing/first-draft/whats-difference-between-cajun-and-creole-or-there-one

Black dialects (In America) have historically been treated as bastardizations, pollution, or just plan ignorance of proper grammar and syntax. I witnessed the short life of "Ebonics" until it was ridiculed into oblivion. I can remember plenty of people who spoke a Southern dialect only loosely associated with English mercilessly mocking and doing impressions of AAVE.

Common parlance like "They don't think it be that way, but it do" is a contemporary example of black dialect with syntax and rules as complex as any language being mocked as as stupidity, ignorance, and an inability to speak English.

I'm not the racist police. If you've laughed at these jokes or told them, I don't think your a bad person. I've laughed and spread them myself. This is just an FYI.

@yesman hmm that talks about people creoles specifically american though. not language creoles. Once upon a time, our country also had creoles so I know about the term.

[–] gratux@lemmy.blahaj.zone 8 points 2 days ago

I am no expert, but I think it depends on how present those are in the training data. If you have barely any representation, I wouldn't be surprised if the LLM thinks it is ome if the parent languages instead.

[–] droning_in_my_ears@lemmy.world 5 points 2 days ago (1 children)

I don't know about creoles but with dialects in my experience it mixes the standard language, the dialect I use with it and other dialects. Probably confusing that they all use the same writing system and have some common vocab.

[–] neidu3@sh.itjust.works 6 points 1 day ago* (last edited 1 day ago) (1 children)

As an aside, I was surprised to see that when I wrote chatgpt a sentence written in an obscure (mine) norwegian dialect, peppered with slang and nonstandard contractions, chatgpt actually understood me very well. It seems LLMs are really good at checking possible root words and their origins to infer meaning when it's not recognizing the word.

For example, I wrote "veitkji", which is a contractions of "veit ikkji", which is normally written "vet ikke". However, cgpt correctly interpreted it as "some scandinavian non-standard dialect for "don't know" "

Oh yeah it understood me too, but its reply was a weird mix of dialect and non dialect

[–] whaleross@lemmy.world 2 points 1 day ago (1 children)

I know that the current generation of LLM have a language agnostic knowledge base, which is damn awesome, but I don't know how the language layer works.

[–] Reshirams_Rad_Slam@mastodo.neoliber.al 1 points 1 day ago (1 children)

@whaleross what does that agnosticism mean? Don't you need to store that knowledge somehow in say maybe a language?

[–] whaleross@lemmy.world 1 points 1 day ago (1 children)

Yeah but not one "master" language. All knowledge it can relate across languages is available in all languages. LLM (and computers) don't care about the data, they just process it. Humans would translate it and compile it in ordered encyclopedia. For LLM it is all just an insane number of references and cross references all over that is available from anywhere that the link has been established. The input/output of desired language and formulation and whatever is a different part of it.

As far as I understand it.

@whaleross but those references would be in a language, wouldn't it? Unfortunately I can't test it since I don't think the most obscure language I know is obscure enough