this post was submitted on 28 May 2026
243 points (97.6% liked)
A Boring Dystopia
16796 readers
774 users here now
Pictures, Videos, Articles showing just how boring it is to live in a dystopic society, or with signs of a dystopic society.
Rules (Subject to Change)
--Be a Decent Human Being
--Posting news articles: include the source name and exact title from article in your post title
--If a picture is just a screenshot of an article, link the article
--If a video's content isn't clear from title, write a short summary so people know what it's about.
--Posts must have something to do with the topic
--Zero tolerance for Racism/Sexism/Ableism/etc.
--No NSFW content
--Abide by the rules of lemmy.world
founded 3 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Short youtube video explaining why tokenisation causes this bug. It's an older video, so it talks about tokens as being whole-word rather than chunks of words, which is how most modern models work.
https://youtube.com/shorts/7pQrMAekdn4
The other persons explanation doesn't acknowledge that emergent reasoning does kind-of exist in LLMs. That's why theyre able to say how many 5's are in a large number, despite never seeing that problem before. They dont 'just' repeat things they've been trained on, though they often do.
Of course, if that problem did exist significantly in the training data, it would be more likely to get it right. But you could say the same about any number of things an LLM doesn't know.