this post was submitted on 17 Jun 2026
67 points (95.9% liked)

Technology

85494 readers
4120 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 3 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] VonReposti@feddit.dk 9 points 3 hours ago (1 children)

Already is, take a look at devstral, qwen3.6, deepseek coder. All can be run on a hugh end GPU and if you're a developer you likely have one.

[–] makeshift0546@lemmy.today 12 points 2 hours ago* (last edited 2 hours ago) (2 children)

The vast majority of users ain't running anything but 27b max, more likely 14b, and that shit just ain't nearly as good as older saas models much less dominant like opus. Maybe for small shit but complex talks just ain't fitting on home hardware.

[–] VonReposti@feddit.dk 1 points 10 minutes ago

Completely agree, I forgot to mention that part. I am testing a few models ranging from 18b to 26b on my 7900xt. It is far from "make this complete system", but it can handle some smaller tasks. I think that will be the end goal anyway since cloud models fail a lot at maintainability, security, and other higher levels of thought that goes into coding. They can make a convincing prototype but I wouldn't hook it up to production.

Local models are already functioning well as a force multiplier. It can help explain logic, do minor refactoring, debugging etc. but with a bit of latency. I do think this is where we're headed since the frontier models required for generating a full prototype can't make production quality code and it is prohibitively expensive to do so. As far as I've heard, they're generally running spending ten times as much as they earn per token.

[–] naeap@sopuli.xyz 2 points 2 hours ago

Sadly, that's true

Tried to refactor a spaghetti code state machine and thought, well, AI should handle this well. All the logic is there, just separate it into small functions to clean up the large one.

None was able to, alone because of the context window already

To be fair though, I tried Mistral online and it also stumbled around. ChatGPT was a complete clusterfuck - haven't tried Claude.

To be even fairer... it's a really large state machine, which was written on site during a fever and in stress - so... To defend myself a bit as well, how it even came to that ;⁠-⁠)

But seems, I'll need to go through this myself
Actually thought, that this would be a perfect example for using AI...