What is the Watts-per-Petabyte shift in storage?

Hyperscalers used to price storage in dollars per gigabyte. AI starves data centers of power, so the binding constraint becomes watts per petabyte. Pure Storage CEO Charlie Giancarlo named this shift directly, and it reorders every purchasing decision around energy instead of raw capacity.

Why does this threaten Seagate and Western Digital?

Hyperscalers buy most of the bulk HDDs. As they move to ultra-dense flash, HDD volumes drop and fixed costs spread across fewer units, so disk loses its cost advantage. Western Digital's diverging flash and HDD revenue already shows this is structural, and the gap keeps widening.

How Pure Storage Won a Hyperscaler by Selling Back Megawatts

Q: Why did a hyperscaler buy proprietary flash instead of commodity SSDs?

AI GPU clusters left data centers starved for power. Pure Storage's DirectFlash skips the commodity SSD controller and drops over-provisioning and garbage-collection overhead, so each rack reclaims watts. When the constraint is watts per petabyte, that power saving beats the cost edge of commodity drives.

A Hyperscaler Just Bought Proprietary Hardware. That Should Terrify Seagate.

For fifteen years, hyperscalers set the terms of infrastructure buying. Build it yourself. Buy commodity parts. Treat every box like a replaceable unit. That white-box model crushed margins across servers, networking, and storage. So when one of the top four cloud providers signed a deal to pull out hard drives and replace them with Pure Storage's DirectFlash modules, most coverage defaulted to the old explanation: flash got cheap enough, disk finally lost on cost.¹ That misses the real driver.

Power constraints, not flash prices, drove the purchase. AI clusters are eating data center power budgets, and the fastest way to free capacity is to remove racks of spinning disks that sit there drawing electricity just to hold cold data. Pure won because its architecture gives back more watts per rack than commodity SSDs can. Call it the Joule-Byte Inversion: the point where storage stops being judged by dollars per gigabyte and starts being judged by watts per petabyte.

The power constraint, not the media cost curve, explains the hyperscaler's break from commodity storage. It also explains why Pure can plausibly say the last hard drive gets sold in 2028, and why Seagate and Western Digital are staring at something worse than a normal downturn.

Why the AI Power Wall Made Watts the New Currency

SemiAnalysis put it plainly: "The limiting factor for AI scaling is no longer silicon availability, but the ability to secure gigawatts of power and cool the resulting infrastructure."² That one sentence changes storage buying downstream. The IEA says electricity use from data centers, AI, and crypto could double by 2026.³ Every watt inside a data center is now contested.

Operators boxed in by grid limits usually do one of two things: wait years for new capacity or rip watts out of existing racks. Storage is the easiest place to start. A rack full of hard drives storing archive data burns power nonstop and takes up space that GPU systems now claim more profitably. Meta's next-generation design is, in its own words, "focused entirely on supporting the power and cooling requirements of liquid-cooled AI hardware."⁴ Anything not feeding that build-out is exposed.

Charlie Giancarlo made the tradeoff explicit: "AI is power-hungry. To feed these GPUs, data centers have to find power savings elsewhere, and the easiest place to find it is by replacing spinning mechanical disks with flash."⁵ That shift puts storage budgets in the same internal fight as cooling systems, power distribution, and GPU deployment.

a glowing GPU rack and a row of spinning hard drives connected to the same single power meter, the GPU side pulling the needle into the red — Inside a power-capped data center, every watt a hard drive draws is a watt the AI cluster cannot have.

The Joule-Byte Inversion: From Dollars-per-Gigabyte to Watts-per-Petabyte

Hyperscalers used to price storage in dollars per gigabyte. Now they price it in watts per petabyte.⁶ Once power becomes the limiting metric, HDDs lose the one comparison that kept them attractive.

In a $/GB world, spinning disk still wins on raw bulk capacity. In a Watts/PB world, the motor inside every drive becomes a tax you pay every second. The SNIA benchmark matters because, at scale, moving from HDD to high-density QLC NAND delivers "a non-linear reduction in total cost of ownership when factoring in power, cooling, and rack footprint."⁷ Storage planning used to scale roughly with capacity; once power and cooling dominate, that math stops behaving linearly. Backblaze's field data points at the same problem: as HDD capacities rise, failure rates and power use become "critical variables in total cost of ownership."⁸ Bigger drives help on paper, but hyperscalers do not run storage on paper.

Metric	$/GB era	Watts/PB era
Binding constraint	Capital per byte	Power per rack
Winning media	Nearline HDD	High-density QLC flash
Cost model	Linear with capacity	Non-linear (power + cooling + space)
Buyer's question	How cheap per byte?	How many megawatts does it free?

For the buyer, the real output was reclaimed power budget, not merely a media swap.

The Controller Bypass Moat: Why Commodity SSDs Couldn't Win This Deal

More striking than the move from HDDs to flash was the move from commodity SSDs to Pure's proprietary modules. That is the part of the deal vendors should lose sleep over. Pure says its systems cut energy use and emissions "by up to 85% compared to competitive all-flash systems, and even more when replacing traditional spinning disk."⁹ If that claim is even directionally right at hyperscale, then normal SSD arrays were never close to winning this account.

The key difference is the controller: commodity SSDs still emulate hard drives through legacy interfaces. That means each drive carries its own controller, does its own over-provisioning, and burns power on garbage collection just to preserve compatibility. DirectFlash removes that layer. Pure's engineers describe it this way: "By speaking directly to the NAND flash and bypassing the traditional SSD controller, DirectFlash modules eliminate the over-provisioning and garbage collection overhead that plagues commodity SSDs."¹⁰ Pure's 10-K says the design delivers "higher density, better performance, and lower power consumption."¹¹ In practice, Pure packs more NAND into each rack while wasting less energy on translation overhead.

That is the Controller Bypass Moat. Under the old buying model, SSD controller overhead looked trivial. Under a watts-per-petabyte model, it looks like dead weight. Blocks & Files called the deal "a watershed moment," and they are right. A major hyperscaler choosing proprietary enterprise flash over commodity SSDs or HDDs signals a break in cloud storage doctrine.¹²

a sleek flash module speaking directly to a wall of raw memory chips, while a bulky translator box labeled controller sits unplugged and discarded on the floor — DirectFlash bypasses the SSD controller entirely, eliminating the garbage-collection overhead that commodity drives carry.

The HDD Volume Death Spiral and the Eviction of Spinning Disk

If hyperscalers stop buying bulk HDDs, the scale economics that kept disk cheap start to crack. Hyperscalers are the anchor customers for bulk storage. Take them out of the market and the whole cost structure weakens.

You can already see the split in company filings. Western Digital said flash revenue rose while HDD revenue reflected "ongoing cloud inventory digestion."¹³ Seagate described "a complex demand environment in the mass capacity market as cloud providers optimize their infrastructure."¹⁴ That is not normal cyclical weakness. If this were just macro softness, both product lines would be under pressure in the same way.

Projected data center, AI and crypto electricity use — The IEA projects consumption could double between 2023 and 2026, the macro force evicting power-hungry disks.

This is the death spiral. As hyperscaler HDD orders fall, factories spread fixed costs over fewer drives, raising unit economics and weakening disk's price edge. Then flash gets relatively easier to justify, especially as QLC prices keep moving down. TrendForce says QLC enterprise SSD adoption is "accelerating rapidly as the price-per-capacity gap with nearline HDDs narrows, driven by AI server demand."¹⁵ Pure's claim that the last hard drive sells in 2028 looked aggressive in 2023.¹⁶ It looks less aggressive once hyperscaler demand stops supporting the economics of the disk business.

The Broader Rule: Sell the Bottleneck Relief

The lesson here reaches beyond storage. When one resource becomes scarce, every buying decision around it gets repriced. The winners are the companies that give the scarce resource back.

That is what Pure sold. Not flash. Not even storage performance. It sold recovered power budget that the customer could redirect into GPUs.

Founders should pay attention to that pattern. Audit the waste your customer has learned to ignore. Find the metric they used to optimize for, then identify the one they are now forced to ration. The moment that changes, old incumbents start defending the wrong thing.

And buyers should stop pretending commodity is a principle. It is a tactic. Hyperscalers stuck with commodity hardware for fifteen years because it fit the binding constraint of the era. The moment proprietary gear paid better in megawatts, doctrine vanished.

If AWS, Meta, or Microsoft start treating idle storage watts as GPU budget, Seagate and Western Digital won't be fighting a cycle. They'll be fighting demand collapse.

We are moving from an era where storage was measured in dollars per gigabyte to an era where it is measured in watts per petabyte.

Charlie Giancarlo · theCUBE Interview, Pure Accelerate 2024

Key Takeaways

1A top-four hyperscaler swapped its hard drives for Pure Storage DirectFlash to reclaim megawatts for AI GPU clusters. Cost-per-terabyte never entered the decision.
2Pure Storage says its systems cut energy and emissions by up to 85% against rival all-flash arrays, and the gap widens when they replace spinning disk.
3SNIA found that moving from HDD to high-density QLC NAND drops total cost of ownership sharply once you count power, cooling, and rack footprint.
4The IEA projects that data centers, AI, and crypto could double their electricity use by 2026.
5Western Digital's flash revenue climbed while its HDD revenue fell on cloud inventory digestion, so the storage market is now split by media type.

Keywords

DirectFlashHyperscaler StorageQLC NANDData Center PowerPure StorageHDD Decline

Back to Articles

Share:

X LinkedIn WhatsApp Facebook