Damn we're really sticking it to Google by repeatedly using their service to try to one-up each other's screenshots. We should keep doing that and sharing the screenshots which will convince more people to do the same. They'd hate getting those usage metrics!
A Boring Dystopia
Pictures, Videos, Articles showing just how boring it is to live in a dystopic society, or with signs of a dystopic society.
Rules (Subject to Change)
--Be a Decent Human Being
--Posting news articles: include the source name and exact title from article in your post title
--If a picture is just a screenshot of an article, link the article
--If a video's content isn't clear from title, write a short summary so people know what it's about.
--Posts must have something to do with the topic
--Zero tolerance for Racism/Sexism/Ableism/etc.
--No NSFW content
--Abide by the rules of lemmy.world
This is why the steam deck is $1,000 btw
Interestingly, it would probably do a better job of writing a piece of code to count how many T's there are, and then reading output of that.
Yeah, it's pretty efficient at that. When the strawberry version was around, CGPT wrote some python and executed it after asking it programatically
I'll never tire of LLM aneurysms.
"Coloniatism" is my favourite

And this shit is "taking people's jobs"
Hahahahahahahahahahahahahahahaha
Is this actually real though?
Just tested with similar results, output was:
There are exactly 2 't's in the word 'colonialism'. C-o-l-o-n-i-a-l-i-s-m Would you like to check the spelling or character count of any other words? Let me know!
Wow, I didn't think it was still that stupid
I don't think this particular genre of stupid will ever be fully fixed in LLMs to be honest, it's fairly structural
I hope so
What causes this?
LLMs break words up into chunks of letters which commonly appear - suffixes like "-tion" and "-ism" are obvious examples. They then predict which chunk comes next based on the ones before, or whether the word will end.
This is very useful for generating sensible-looking text while at the same time correlating concepts associated with different words. However, it also means that the dont really "see" the letters that make up each word, just the chunks of letters, which are stored as mathematical vectors. This is why they struggle so much with analysing the makeup of words.
However, with numbers they generally store each digit individually, so they shouldnt have as much of a problem saying how many 5's are in 1,589,005, for example.
Two very different answers to this question...
Short youtube video explaining why tokenisation causes this bug. It's an older video, so it talks about tokens as being whole-word rather than chunks of words, which is how most modern models work.
https://youtube.com/shorts/7pQrMAekdn4
The other persons explanation doesn't acknowledge that emergent reasoning does kind-of exist in LLMs. That's why theyre able to say how many 5's are in a large number, despite never seeing that problem before. They dont 'just' repeat things they've been trained on, though they often do.
Of course, if that problem did exist significantly in the training data, it would be more likely to get it right. But you could say the same about any number of things an LLM doesn't know.
Simply put, LLMs are great at forming sentences but can't do math. Like, any math. If they get 60+21=81 right, it's only going to be because it's textually written somewhere in the training data that 60+21=81. However, it's very unlikely for counting the number of Ts in colonialism to be in there, so it just hallucinates what it thinks is a correct response.
i seen without "thinking", it tells you if there is 2.
google's search ai does not have "thinking"
the looping thing i also seen before.
It took several tries but I got one that looped. Most of the time it gives the "there are 2" and puts random arrows.
This used to happen on chatgpt with "Is there a seahorse emoji". Here's a video explaining why this happens.

Wait the seahorse emoji is not real???
Whenever I see these, I try them out. Sometimes I can reproduce them, this one I can't. However, I remain extremely skeptical and believe that whenever one of these screenshots goes around, someone at LLM Company hard codes a fix for that specific fuck up. Against how many of these hard fixes does each LLM answer get checked nowadays?!
funnily enough, usually I can't reproduce these, but this one I could. It's not quite as unhinged but still definetly very wrong

I tried it, and got similiar shorter answers but not the exact same answer. Sometimes it ends up getting it right at the end after fumbling a lot, and sometimes it just fails completely.
Searching on Google directly sometimes doesn't produce the AI Overview on stuff like these in my experience, but passing the search to Google from DDG with the bang (!g) almost always produces the AI Overview.
edit: I tried it again and it grew the ability of humor:
There are 2 't's in the word colonialism. colt-a-ca-l-i-s-m (just kidding) C-o-l-o-n-i-a-l-i-s-m:
- t = 0 (If you were thinking of colonization, there is still only 1 't' in the word.)
Interestingly, on one of my attempts it used python to count the number of t's and still ended up getting the "verbal" explanation wrong.
This might be worse then just giving the wrong answer....
On the very long list of shitty things about AI is the fact that they are non-deterministic.
I was however able to get this fuckup on first try:

AI can make mistakes, so double-check responses
I'm sure it's fine. After all, the AI triple-checked its answer!
Wow, you're right. I got three different answers over five queries.
Right. Cause they aren’t answering the question. They are just determining the next most likely word.
Here's what I got:

I can't believe Google missed that third 't' the first time around. So sloppy.