Selfhosted

60093 readers

622 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.
No spam.
Posts here are to be centered around self-hosting. Please ensure it is clear in your post how it relates to self-hosting.
Don't duplicate the full text of your blog or git here. Just post the link for folks to click.
Submission headline should match the article title.
No trolling.
Promotion posts require your active participation in selfhosting or related communities, or the post will be removed. No more than 10% of your posts or comments may be self-promotional, or your post will be removed. F/LOSS Exception: If your post is about a project that is completely open source & can be self-hosted in full without payment, and your account is at least 30 days old, your post is exempt from this rule as long as you continue to engage in comments.

Resources:

selfh.st Newsletter and index of selfhosted software and apps
awesome-selfhosted software
awesome-sysadmin resources
Self-Hosted Podcast from Jupiter Broadcasting

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 3 years ago

MODERATORS

curbstickle@anarchist.nexus

curbstickle_lw@lemmy.world

401

Western Digital details 14-platter 3.5-inch HAMR HDD designs with 140 TB and beyond (www.tomshardware.com)

submitted 4 months ago by veeesix@lemmy.ca to c/selfhosted@lemmy.world

134 comments fedilink hide all child comments

cross-posted from: https://beehaw.org/post/24650125

Because nothing says "fun" quite like having to restore a RAID that just saw 140TB fail.

Western Digital this week outlined its near-term and mid-term plans to increase hard drive capacities to around 60TB and beyond with optimizations that significantly increase HDD performance for the AI and cloud era. In addition, the company outlined its longer-term vision for hard disk drives' evolution that includes a new laser technology for heat-assisted magnetic recording (HAMR), new platters with higher areal density, and HDD assemblies with up to 14 platters. As a result, WD will be able to offer drives beyond 140 TB in the 2030s.

Western Digital plans to volume produce its inaugural commercial hard drives featuring HAMR technology next year, with capacities rising from 40TB (CMR) or 44TB (SMR) in late 2026, with production ramping in 2027. These drives will use the company's proven 11-platter platform with high-density media as well as HAMR heads with edge-emitting lasers that heat iron-platinum alloy (FePt) on top of platters to its Curie temperature — the point at which its magnetic properties change — and reducing its magnetic coercivity before writing data.

you are viewing a single comment's thread
view the rest of the comments

[–] thejml@sh.itjust.works 7 points 4 months ago (1 children)

Rebuild time is the big problem with this in a RAID Array. The interface is too slow and you risk losing more drives in the array before the rebuild completes.

[–] rtxn@lemmy.world 8 points 4 months ago* (last edited 4 months ago) (3 children)

Realistically, is that a factor for a Microsoft-sized company, though? I'd be shocked if they only had a single layer of redundancy. Whatever they store is probably replicated between high-availability hosts and datacenters several times, to the point where losing an entire RAID array (or whatever media redundancy scheme they use) is just a small inconvenience.

[–] enumerator4829@sh.itjust.works 4 points 4 months ago

Fairly significant factor when building really large systems. If we do the math, there ends up being some relationships between

disk speed
targets for ”resilver” time / risk acceptance
disk size
failure domain size (how many drives do you have per server)
network speed

Basically, for a given risk acceptance and total system size there is usually a sweet spot for disk sizes.

Say you want 16TB of usable space, and you want to be able to lose 2 drives from your array (fairly common requirement in small systems), then these are some options:

3x16TB triple mirror
4x8TB Raid6/RaidZ2
6x4TB Raid6/RaidZ2

The more drives you have, the better recovery speed you get and the less usable space you lose to replication. You also get more usable performance with more drives. Additionally, smaller drives are usually cheaper per TB (down to a limit).

This means that 140TB drives become interesting if you are building large storage systems (probably at least a few PB), with low performance requirements (archives), but there we already have tape robots dominating.

The other interesting use case is huge systems, large number of petabytes, up into exabytes. More modern schemes for redundancy and caching mitigate some of the issues described above, but they are usually onlu relevant when building really large systems.

tl;dr: arrays of 6-8 drives at 4-12TB is probably the sweet spot for most data hoarders.

[–] brygphilomena@lemmy.dbzer0.com 2 points 4 months ago

I'd imagine they are using ceph or similar.

You have disk level protection for servers. Server level protection for racks. Rack level protection for locations. Location level protection for datacenters. Probably datacenter level protections for geographic regions.

It's fucking wild when you get to that scale.

[–] thejml@sh.itjust.works 1 points 4 months ago (1 children)

True, but that's going to really be pushing your network links just to recover. Realistically, something like ZFS or a RAID-6 with extra hot spares would help reduce the risks, but it's still a non trivial amount of time. Not to mention the impact to normal usage during that time period.

[–] frongt@lemmy.zip 3 points 4 months ago (1 children)

Network? Nah, the bottleneck is always going to be the drive itself. Storage networks might pass absurd numbers of Gbps, but ideally you'd be resilvering from a drive on the same backplane, and SAS-4 tops out at 24 Gbps, but there's no way you're going to hit that write speed on a single drive. The fastest retail drives don't do more than ~2 Gbps. Even the Seagate Mach.2 only does around twice that due to having two head actuators.

[–] thejml@sh.itjust.works 1 points 4 months ago

100%. But the post i was responding to was talking about recovering a failed array from other copies, not locally.