Asklemmy

54673 readers

234 users here now

A loosely moderated place to ask open-ended questions

Search asklemmy 🔍

If your post meets the following criteria, it's welcome here!

Open-ended question
Not offensive: at this point, we do not have the bandwidth to moderate overtly political discussions. Assume best intent and be excellent to each other.
Not regarding using or support for Lemmy: context, see the list of support communities and tools for finding communities below
Not ad nauseam inducing: please make sure it is a question that would be new to most members
An actual topic of discussion

Looking for support?

Looking for a community?

Lemmyverse: community search
sub.rehab: maps old subreddits to fediverse options, marks official as such
!lemmy411@lemmy.ca: a community for finding communities

~Icon~ ~by~ ~@Double_A@discuss.tchncs.de~

founded 7 years ago

MODERATORS

Do LLMs "have" the "abillity" to be told they are wrong or incorrect and be able to contest that? (lemmy.world)

submitted 3 days ago by cheese_greater@lemmy.world to c/asklemmy@lemmy.ml

35 comments fedilink hide all child comments

I think i've only once flat out told one it was wrong about a specific assertion I quoted and it immediately was able to find its way to what I knew to be the correct claim.

I just wonder what would happen if i was in fact mistaken and I told it confidently it was wrong without elaborating

you are viewing a single comment's thread
view the rest of the comments

[–] lucg@lemmy.world 1 points 5 hours ago

I remember reading something about LLMs not being able to learn "x is y" equivalence relations. Can't find it now but limitations like this are what make differences clear between what humans do and what we've managed to teach the neural network (which will be used to iterate and improve the model further, of course)

In the Chinese box analogy, this would be like them knowing cats are considered cute but not whether considered-cute animals include cats (if I remember the limitation type correctly). If you happen to slip the right instructions/questions, something they've seen before or something they're capable of extrapolating, then nothing seems off; but if someone can say in one paragraph that cats are cute but they know of no cute animal, you'd not think they're understanding what they're saying, and so don't really understand the language even if they give you plausible words in all other cases

(For cats it'll work because there's a billion example sentences out there. LLM vendors are also trying to sidestep such problems by having it generate a bunch of tangential text (in which it might happen to regurgitate the tokens it needs to piece together the answer) before answering the prompt, but that's still not being able to apply logic)