Hardware

7173 readers

344 users here now

All things related to technology hardware, with a focus on computing hardware.

Some other hardware communities across Lemmy:

Rules (Click to Expand):

Follow the Lemmy.world Rules - https://mastodon.world/about
Be kind. No bullying, harassment, racism, sexism etc. against other users.
No Spam, illegal content, or NSFW content.
Please stay on topic, adjacent topics (e.g. software) are fine if they are strongly relevant to technology hardware. Another example would be business news for hardware-focused companies.
Please try and post original sources when possible (as opposed to summaries).
If posting an archived version of the article, please include a URL link to the original article in the body of the post.

Icon by "icon lauk" under CC BY 3.0

founded 2 years ago

MODERATORS

Alphane_Moon@lemmy.world

Rekall_Incorporated@piefed.social

Anthropic in early talks to buy DRAM-less AI inference chips from UK startup — Fractile's SRAM architecture reduces need for pricey memory during extreme pricing and shortage crunch (www.tomshardware.com)

submitted 9 hours ago by Rekall_Incorporated@piefed.social to c/hardware@lemmy.world

1 comments fedilink hide all child comments

The Claude developer is exploring a fourth chip supplier alongside Nvidia, Google, and Amazon.

top 1 comments

sorted by: hot top controversial new old

[–] cyrl@lemmy.world 1 points 1 hour ago

Anything that reduces the footprint of LLM's is welcome, however...

making LLM compute cheaper in datacentres won't mean lower total power/cooling/space/water consumption, like adding lanes and traffic, it will just mean more usage as it gets cheaper (and a short-term bump in margins for the LLM owners)
these are still highly dedicated chips that are always going to be bound up in the mega-scale datacentre deployments
- what happens if there is a paradigm shift in the exact compute architecture? Loads of junk servers and no applications able to make use of such a glut
- these do nothing to push LLMs out of the datacenter and into non-corporate hands, which is the only spot where we might see fewer privacy concerns, less corporate control etc

If we're stuck with the current compute/corporate paradigms, at least alternatives nibling at the unhealthy dominance of nVidia and the cloud giants is some small benefit.