Technology

83990 readers

3243 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

162

Linux kernel czar says AI bug reports aren't slop anymore (www.theregister.com)

submitted 3 weeks ago by General_Effort@lemmy.world to c/technology@lemmy.world

98 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] riskable@programming.dev 5 points 3 weeks ago (12 children)

Same places as usual: Academia and open source foundations.

That's where 99% of all advancements in AI come from. You don't actually think Big AI is paying as many people to do computer science and mathematics research as all the universities in the world (with computer science programs)?

It's the same shit as always: Big companies commercialize advancements and discoveries made by scientist and researchers from academia (mostly) and give almost nothing back.

Big AI has partnerships with tons of schools and if it weren't for that, they wouldn't be advancing the technology as fast as they are. In fact, the only reason why many of these discoveries are made public at all is because of the agreements with the schools that require the discoveries/papers be published (so their school, professors, researchers, and students can get credit).

Like I was saying before: You don't need a trillion dollars in data centers to do this stuff. Almost all the GPUs and special chips being used (and preordered, sigh) by Big AI are being used to serve their customers (at great expense). Not for training.

Training used to be expensive but so many advancements have been made this is no longer the case. Instead, most of the resources being used in "AI data centers" (and research) is all about making inference more efficient. That's the step that comes after you give an AI a prompt.

Training a super modern AI model can be done with a university's data center or a few hundred thousand to a few million dollars of rented GPUs/compute. It doesn't even take that long!

Generative AI improves at a ridiculously fast rate. In nearly all the ways you could think of: Training, inference (e.g. figuring out user intent), knowledge, understanding, and weirder, fluffier stuff like "creativity" (the benchmarks of which are dubious, BTW).

[–] XLE@piefed.social -2 points 3 weeks ago* (last edited 3 weeks ago) (11 children)

Before we spin into a tangent about theory and "what ifs" etc, care to link me to all these great models from academics and open-source institutions?

Because right now, the only companies I see making advancements in "AI" are burning through obscene amounts of cash, with no end in sight.

And there is no evidence the cost of inference is going down, and even Anthropic admits training will continue burning resources.

[–] riskable@programming.dev 1 points 3 weeks ago (4 children)

You seem to be unaware that it only takes about four NVIDIA HGX H100 nodes (32 GPUs) to train something like qwen3.5:122b. That model is about as good as ChatGPT was six months to a year ago (for the usual use cases). That would take a long ass time though (over a year) so you'd want probably 50-100 HGX H100s (or lots of the newer, cheaper ARM-based hardware devices).

The weights for qwen3.5:122b are open. That means that if you've got the hardware (loads of universities and non-profits have waaaay TF more than 4 HGX H100 nodes) you can continue modern AI development. Everything you need is right there on Huggingface! Deepseek's stuff is also open I think but I forget. Aside: In my head, I hold the qwen models as "the gold standard" based on many articles I've read about them but AI moves so fast, there might be better stuff out on any given day! I haven't read AI news in like a week so I could be all wrong and qwen3.5 is now sooo obsolete, hehe (that's how it feels to follow AI news, anyway 🤣).

Even more interesting: qwen3.5:122b isn't just an LLM. It does visual reasoning (e.g. give it a picture of a plant and ask it to identify it, count the number of screws in an image, estimate distances, etc) as well as the usual LLM stuff. You can read all about it here:

https://ollama.com/library/qwen3.5:122b

...and if you install ollama and spend $20 on ollama.com's cloud service you can actually try it out without having to own enough GPUs to cover the 245+GB requirement. I highly recommend that service! You can try out all the latest & greatest models on your local PC (or phone!) for any purpose you want for a $20. Whenever a new model is out they usually have it up on their servers within a day or two and it's fast, too.

FYI: I've used ollama cloud to evaluate models for coding (web dev with Python back end) and qwen3.5:122b is fantastic. It's not as good as Claude Opus 4.6 but it's close (and cheap) enough that you can just make up for the mistakes with extra instances that check the output with a critical eye (the latest trick in AI-based coding to get good output).

For reference, the University of Texas at Austin has data centers with 4,000 NVIDIA Blackwell (B200/GB200) GPUs, Harvard has 1,144 GPUs, and the University of Cambridge & Bristol (in the UK) has some monstrous mix of Intel and AMD GPUs. All three are perfectly capable of training new models from scratch or using continuing development on existing open-weight models like Deepseek and Qwen.

Generative AI isn't going anywhere. Furthermore, advancements in that space happen so fast that it's likely that in a few years we won't need so many GPUs/VRAM to train models. Especially if ternary models (and similar, like Google's TurboQuant tech) take off.

I know this is a long comment but I want to point something else out: If OpenAI and Anthropic go bust, that would flood the market with cheap GPUs. It would be a total price collapse and you can bet your ass that clever universities and service providers (like Amazon compute, but 3rd party) would snap those up and bring down prices across the board.

load more comments (3 replies)

load more comments (9 replies)