this post was submitted on 07 Jun 2026
723 points (99.1% liked)

Technology

85515 readers
4191 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 3 years ago
MODERATORS
 

The injured teenage survivor of a January 2025 shooting at a Nashville, Tennessee high school recently sued the manufacturer of an “AI gun detection” system that failed to detect the handgun that left two dead, including the shooter.

According to the lawsuit, which was filed in Davidson County court last month, the security company Omnilert either knew or should have known that there were “significant operational limitations in its gun detection system that could result in detection failures during actual emergencies, including limitations based on camera placement, proximity of the weapon to camera sensors, camera angle, lighting, and weapon visibility.”

Omnilert cofounder Ara Bagdasarian declined Ars’ invitation to answer questions about the lawsuit. System Integrations, the other defendant in the case, which resold the Omnilert system, also did not respond to Ars’ request for comment.

you are viewing a single comment's thread
view the rest of the comments
[–] db2@lemmy.world -5 points 1 week ago (2 children)

Do you think LLMs are being used for this sort of thing?

Yes. It took all of five seconds to find out too.

No, datacentres are not being used for real-time gun detection

You've already been wrong once, care to try for two?

[–] Wispy2891@lemmy.world 4 points 1 week ago (1 children)

Using a LLM for detecting a specific object on an image is possible but stupid: if your object is always the same (like in this case) it's several orders of magnitude cheaper to train once on that specific object then use the computer vision model running directly on the local server that's recording the video.

Otherwise:

  1. the api costs would be colossal, 0.001$ per each image, at 30 fps it's $100 per hour, nobody would pay that
  2. The detection latency would be several seconds vs almost instant
  3. Without internet connection the system wouldn't work

Use cases for LLM-based image recognition is if the object changes at every request or it's ultra specific with brands and colors

[–] db2@lemmy.world -1 points 1 week ago (2 children)

if your object is always the same (like in this case)

It isn't the same though. A large gauge shotgun and a small gauge pistol are pretty different looking. Compare those to a .22 rifle with a scope, and those to a decked out ar15. That's a lot of different always the sames. What if it's a revolver? Or has a folded stock? Or a sawed off stock? Will it recognize a derringer or a mac10 with a large capacity mag as guns?

We can because they make us dead. We have valid reason to fear them which is a great motivator for most species to learn to recognize the danger. You'd still recognize a ring gun as a gun, without getting specifically trained to do so a machine will identify it as jewelry.

[–] Wispy2891@lemmy.world 5 points 1 week ago (1 children)

so, train the computer vision model for a gun and train again for a shotgun. Run the two detection models at the same time.

Your approach is the typical "but if you really want you can use an atomic bomb to kill mosquitoes" - yes, you could do that, but nobody is paying $1 mil/year in inference costs (+some expensively licensed software to wrap around that) when it can be done locally with a $300 GPU (+ some expensively licensed software to wrap around that)

[–] db2@lemmy.world -1 points 1 week ago

I gave a lot more than two examples and it was hardly exhaustive.

[–] CeeBee_Eh@lemmy.world 3 points 1 week ago

A large gauge shotgun and a small gauge pistol are pretty different looking. Compare those to a .22 rifle with a scope, and those to a decked out ar15. That's a lot of different always the sames. What if it's a revolver? Or has a folded stock? Or a sawed off stock? Will it recognize a derringer or a mac10 with a large capacity mag as guns?

You seem to think that computer vision models can only be trained on a single thing. You simply train your modem on as many object types as you want it to be aware of. That's it.

[–] CeeBee_Eh@lemmy.world 3 points 1 week ago (1 children)

Yes. It took all of five seconds to find out too.

Didn't I just say that slapping an LLM vision model on to dozens of camera streams would be a near impossible technical hurdle?

I never said vLLM models don't exist. I said they're impractical for this use case.

You've already been wrong once, care to try for two?

Haven't been wrong yet. You on the other hand...

[–] db2@lemmy.world -2 points 1 week ago (1 children)

There are several examples of exactly what I said, contradicting your repeated claim. Since I don't want to talk to someone with the conversational ability of Donald Trump demanding things be true in spite of evidence they're not im going to be blocking you now. Have a nice day.

[–] CeeBee_Eh@lemmy.world 2 points 1 week ago

There are several examples of exactly what I said

No one is denying the existence of vision based LLM models. The issue is performance. It takes in the order of double (or even triple) digit seconds to process an image through an LLM. Even if it took a single second to process an image using decent server-grade hardware (which starts at about $10k per card), that's way too much and still not fast enough.

On just 10 cameras at a facility it would require north of $100k on just GPUs alone.

Whereas a specialized computer vision model could process several dozen camera streams, in real-time, on just one of those $10k cards.

An LLM would process an image in 10 seconds (generous) whereas a computer vision model operates in the milliseconds. We're talking about a 1000x difference in required processing power.

That's why you're wrong and have zero clue what you're talking about.

You're arguing that that family uses a fully loaded semi-trailer to go 200m to the local park. It's a clueless and asinine argument.