this post was submitted on 07 Mar 2026
833 points (98.9% liked)

Technology

82460 readers
3036 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
 

Over the past few weeks, several US banks have pulled off from lending to Oracle for expanding its AI data centres, as per a report.

you are viewing a single comment's thread
view the rest of the comments
[โ€“] moto@programming.dev 3 points 1 day ago (1 children)

I like local LLMs as much as the next person but the issue is that doesn't scale the way companies need it to.

As a personal assistant? Sure, I agree. They're useful at times. But as soon as you need multiple to run simultaneously you're gonna hit resource issues.

What Oracle and others were banking on is that you have engineers and others running a lot of agents in parallel composing different things together. Or having one input that multiple serverside agents take and execute numerous tasks on. That's something you can't run on an individual machine right now. And with the way they currently work I don't envision they will anytime soon.

[โ€“] vacuumflower@lemmy.sdf.org 1 points 19 hours ago

There are lightweight models as good as some heavier ones. It's a bit like Intel's tick-tock advertised process. Heavy memory-hungry models are "tick", but there's "tock"- say, "lfm2.5-thinking" model, the light version, in the ollama repository seems almost as good as qwen3.5 for me, except it's very lightweight and lightning-fast compared to that.

These things are being optimized. It's just that in the market capture phase nobody bothered.

That they are not being used correctly - yeah, absolutely, my idea of their proper use is some graph-based system with each node being processed by a select LLM (or just piece of logic) with select set of tools and actions and choices available for each. A bit like ComfyUI, but something saner than a zoom-based web UI. Like MacOS Automator application, rather.