Technology

86745 readers

3744 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 3 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

677

Anthropic/OpenAI may be spending more than $1000 for every $100 you pay them (ea.rna.nl)

submitted 1 month ago by Trilogy3452@lemmy.world to c/technology@lemmy.world

176 comments fedilink hide all child comments

(page 3) 50 comments

sorted by: hot top controversial new old

[–] captain_solanum@sh.itjust.works 2 points 1 month ago* (last edited 1 month ago) (5 children)

looks inside

But if you use the $100 a month Claude Max plan, and you would use it to the weekly limit by going full ‘agentic coding’ (so almost no human in the loop) you would use an amount of tokens that would cost you more than $1000 at API-pricing.

If I watch 600 movies every day on my netflix subscription I am using more energy than I pay them for. Obviously everyone is like me. Therefore they are losing money overall.

Wait, their (netflix) earnings say they made a profit last quarter. But my calculations were waterproof!

Probably anthropic are not net positive, but they are not spending 10x what people pay them for tokens.

load more comments (5 replies)

[–] Burninator05@lemmy.world 2 points 1 month ago (1 children)

I wish that was inversely proportional. The less I pay, somehow it costs them more money.

[–] andallthat@lemmy.world 2 points 1 month ago

Yes, that's called a "marketing budget"

[–] vermaterc@lemmy.ml 2 points 1 month ago (27 children)

So are we assuming here that LLMs won't become more efficient over time? GPT-3 has been a frontier model just a few years ago and it's performance blew everyone's mind at that time. I can now run equivalent LLM on my personal computer. Why can't we expect that after a few years Claude Sonnet level of capability won't be possible to accomplish locally?

[–] greyscale@lemmy.grey.ooo 3 points 1 month ago

It already happened, small language models are busy dragging their nutsack on frontier models, running on a macbook and costing nothing

Where's the fucking product, Sam?

[–] pinball_wizard@lemmy.zip 1 points 1 month ago* (last edited 1 month ago)

So are we assuming here that LLMs won't become more efficient over time?

Mostly. Moore's law ran up against the physical limits of the materials we make chips out of - so desktops of today just do what the desktops of yesterday do, mostly.

We should keep seeing improvements in highly specialized models. There's interesting outcomes to have here, with the right setup and ollama.

but -

The really promising impressive models today are just running with long contexts on shithloads of hardware - which is neither coming to home PCs any time soon nor going to actually be profitable to run any time soon.

There's an argument to be made that running the really interesting model on a ton of hardware might make money for really specific uses - but then when we talk about specific uses that are worth lots of money, those use cases tend to tolerate difficult interfaces and reward accuracy. LLMs invariably reduce accuracy in exchange for ease of use. There might be a sweet spot for a huge expensive hallucination prone LLM in some of these uses, but I doubt it (the entire approach) competes, long term.

There's a few specific use cases where inaccuracy is desirable - largely forms of shifting accountability and some kinds of gambling. Things that either are or should be crimes have a higher tolerance for AI hallucination.

But - a small cheap local model has all the desirable attributes for doing these things (crimes) poorly as a big expensive model. So there's probably not even much money to be made there.

I expect that this tech is not going away, but it's also not earning back the current investment.

load more comments (24 replies)

[–] bangupjobasusual@lemmy.world 1 points 1 month ago

I’m not going to read this whole article, is that opex??

load more comments