this post was submitted on 08 May 2026
62 points (97.0% liked)

Piracy: ꜱᴀɪʟ ᴛʜᴇ ʜɪɢʜ ꜱᴇᴀꜱ

69109 readers
189 users here now

⚓ Dedicated to the discussion of digital piracy, including ethical problems and legal advancements.

Rules • Full Version

1. Posts must be related to the discussion of digital piracy

2. Don't request invites, trade, sell, or self-promote

3. Don't request or link to specific pirated titles, including DMs

4. Don't submit low-quality posts, be entitled, or harass others



Loot, Pillage, & Plunder

📜 c/Piracy Wiki (Community Edition):

🏴‍☠️ Other communities

FUCK ADOBE!

Torrenting/P2P:

Gaming:


💰 Please help cover server costs.

Ko-Fi Liberapay
Ko-fi Liberapay

founded 2 years ago
MODERATORS
top 5 comments
sorted by: hot top controversial new old
[–] SnoringEarthworm@sh.itjust.works 38 points 4 days ago (1 children)

Besides selling the most sought-after hardware, NVIDIA is also developing its own models, including NeMo Megatron models. These were trained using NVIDIA’s own hardware and with help from large text libraries, much like other tech giants do.

...

As the case progressed, the authors also brought up NVIDIA’s contacts with Anna’s Archive, inquiring about “high-speed access” to the shadow library’s massive collection of pirated books.

This is probably why Anna's Archive hasn't been taken down yet - the big fish are pirating, too.

[–] Grumpus_Maximus@thelemmy.club -3 points 4 days ago

these guys gonna lose to china. already chinese coding models almost the same and 1/10 the price. check out z.ai and others

[–] nutbutter@discuss.tchncs.de 14 points 4 days ago (2 children)

Tldr? What is shadow library scripts?

[–] starweasel@hexbear.net 11 points 4 days ago

scripts that NVIDIA distributed to clients so they could automatically download and preprocess The Pile dataset.

sounds like they allegedly wrote some stuff to get faster downloads/avoid throttling while they were allegedly pirating books from shadow libraries for their AI

[–] chahk@beehaw.org 10 points 4 days ago

In addition, the motion also targets the contributory copyright infringement allegations, which center on scripts and tools NVIDIA allegedly distributed so corporate customers could automatically download ‘The Pile,’ the dataset that contains Books3.