this post was submitted on 29 Jul 2025
893 points (99.3% liked)

Technology

73534 readers
3057 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] Auli@lemmy.ca 2 points 3 days ago (1 children)

And what is processing that information?

[–] coach_cheese@lemmy.world 2 points 3 days ago (1 children)

Computer vision commonly uses convolutional neural networks on the input, which is different from the transformer neural networks used in LLMs. If you have more info indicating LLMs are used here please share

[–] mojofrododojo@lemmy.world -2 points 2 days ago (1 children)

If you have more info indicating LLMs are used here please share

two seconds of research would reveal LLMs are ALL OVER COMPUTER VISION. Are convolutional networks used? Yes. Are LLMs used? Yes. And MLLMs.

Tell you what sparky: you find me a source that says ONLY CNNs are used, then you can act like a subject matter expert.

https://arxiv.org/abs/2311.16673

https://techcommunity.microsoft.com/blog/educatordeveloperblog/its-not-just-words-llms-in-computer-vision/3927912

https://medium.com/@tenyks_blogger/multimodal-large-language-models-mllms-transforming-computer-vision-76d3c5dd267f

https://github.com/OpenGVLab/VisionLLM

https://www.chooch.com/blog/how-to-integrate-large-language-models-with-computer-vision/

[–] coach_cheese@lemmy.world 3 points 2 days ago (1 children)

I was actually referring to UVEye which was referenced in the article. I looked into UVEye and nowhere did it say it used LLMs with their computer vision. That’s why I asked if anyone had any info on them using it. The comment I replied to assumed LLMs were used but supplied no evidence. None of the links you shared have anything to do with UVEye either.

[–] mojofrododojo@lemmy.world -1 points 2 days ago (1 children)

Computer vision commonly uses convolutional neural networks on the input,

no where do you specify UVEye.

You could admit they're all over, but instead double down on how I assumed lol

[–] coach_cheese@lemmy.world 3 points 2 days ago (1 children)

Except they are using computer vision, not an LLM

That’s what I initially said, referring to the article. If you have nothing to say regarding the technology in this article that’s fine, but don’t just assume that since there is research of incorporating LLMs into computer vision means it was used in this specific case.

[–] mojofrododojo@lemmy.world 1 points 2 days ago

If you have more info indicating LLMs are used here please share

so I did. whine about it, but they're used in this field, if not this particular case. you asked, I provided.