That's not good business
Technology
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
My first use of Claude this week, for code reviews only(since no LLM can be trusted to write a user story or test suite), had it gaslight me.
It marked down my code for using a specific practice to make some xml safer and easier to read.
When I tried things its way, it wanted me to change it back.
oh it's great isn't it? you ask it for help on some code, provides its solution, you try it and it doesn't work so you respond with the error, it claims YOU wrote it wrong and then when yo utell it "I just copy and pasted what you provided" it says "you're right, i'm sorry."
Claude is to the point now where it just starts hallucinating on the first prompt. it's 100% unreliable now when before it was like 90%. no point in using it, it's garbage. and Claude Code is just as bad now. If you or anyone is using Claude Code to develop ANYTHING I would highly suggest you stop right now because I can guarantee you with nearly 100% certainty that whatever shit it's writing into your stuff isn't going to work. period.
Exactly, never trust an LLM to code. And if it argues back, explain why it’s wrong and that you have nothing but time and experience. Most tend to fold when you point out it’s not a free thinking AI, it’s an entrapped corporate model they designed with preprogrammed biases. But I love arguing 😂.
I use it a lot, and if you are getting these kinds of results you are either trolling, or just flat out not providing the details and guardrails required with your prompts.
I’ve been in software for decades, and if used correctly, yes it can accelerate velocity of building code out. 10x? No.. if you are lucky and careful perhaps 2-4x.
As ALWAYS the human should be in the loop and is on the hook for any code generated.
You just need a better prompt, bro
Just gotta configure and tweak until it gives outputs you find indistinguishable from correct. Just gotta train it to gaslight you properly. Come on don’t you want to be given and endless stream of stuff that looks correct?
I was using a set of template files designed for LLMs to review that project. It is absolutely the fault of Claude that it tools me to do something one way, then told me to try another and when I reverted it said it was the optimal approach.
Where I find it helps is in getting initial starts and as a start to code review. But in both cases they aren't ever operating on their own and their feedback is filtered through myself or another senior dev.
Good thing all the companies leaning hard on AI 10 X'd their profits... Wait...
So much comments on just the title .... Could come from anthropic directly.
There is literally zero basis on the made claim in the article, just arbitrage calculations over supposed token consumptions under non stable test sets.
I have no idea if/how much these ~~stupid~~ fuckers spend to get more customers - and this "article" wasted a lot of time showing that they don't know either.
(Stupid is cut out because I don't think they they're stupid. Which makes it way worse in my book)
Love that for them
Uh .... This doesn't seem like it will end well.
Joke's on them. I'm not paying them a dime.
looks inside
But if you use the $100 a month Claude Max plan, and you would use it to the weekly limit by going full ‘agentic coding’ (so almost no human in the loop) you would use an amount of tokens that would cost you more than $1000 at API-pricing.
If I watch 600 movies every day on my netflix subscription I am using more energy than I pay them for. Obviously everyone is like me. Therefore they are losing money overall.
Wait, their (netflix) earnings say they made a profit last quarter. But my calculations were waterproof!
Probably anthropic are not net positive, but they are not spending 10x what people pay them for tokens.
Except that you would need 50 devices to do that and the most expensive Netflix plan only lets you stream up to 4 devices at a time. Considering the average 2 hours per movie, that's 48 movies per day. That's without mentioning that you'd need to automate this because you'd be asleep for 8 of those 24 hours.
The point is, your analogy doesn't work. There's no reason why someone would do what you're describing and it'd also be very hard to do.
Using up all of your tokens though? Just use agentic coding, set the ""thinking"" to max and you'll see how quickly and easily you can burn through them. Share your account and you'll burn them even faster.
You're right that people can and do max out the expensive plans. Its very difficult to say how often. I just think a majority of anthropics customers are businesses, who often pay per token for easier scaling etc. According to the company, enterprise employees use about $150-$250 per month, (possibly max plans have similar use, which would support your view) but thats in API tokens which they probably have big margins on, so it's less likely anthropic are burning money on inference. If you want to convince me otherwise, its not enough to say that it can happen, it has to be frequent enough to outweigh the B2B sales. They are however likely losing money overall due to training costs etc.
I mean it's not very hard to use up your Claude Max plan, but I find it hard to believe a majority of users do so consistently.