Link Copied!

内存雇佣兵:谁从 5,000 美元的 RTX 5090 中获利?

当消费者指责英伟达时,一个沉默的内存制造商卡特尔正在策划 80% 的价格上涨,这已将旗舰 RTX 5090 推向 5,000 美元。

🌐
语言说明

本文以英文撰写。标题和描述已自动翻译以方便您阅读。

一款高端 NVIDIA GeForce RTX 5090 显卡被拆解,露出 GPU 核心周围的 GDDR7 内存芯片,发出金色的光芒。

Key Takeaways

  • The Memory Cartel: Three companies (Samsung, Micron, and SK Hynix) control 95% of the global DRAM market, allowing them to dictate costs for Nvidia’s partners.
  • The AI Tax: High-Bandwidth Memory (HBM) demand for AI servers has sucked up nearly all advanced lithography capacity, leaving consumer GDDR7 in a permanent supply deficit.
  • Margin Manipulation: Internal manufacturing data suggests memory makers are intentionally shifting wafers away from AI HBM to simpler DDR5 and GDDR7 chips to force higher margins on “budget” components.
  • Historical Rhyme: The current crisis mirrors the 1993 Sumitomo factory explosion that crippled the global epoxy resin supply; however, this time the “explosion” is the sudden vacuum of AI infrastructure demand.

The $5,000 Sticker Shock

If you walked into a high-end PC hardware boutique in late December 2025, you might have seen a price tag that looked like a typo: $4,999 for an NVIDIA GeForce RTX 5090. This wasn’t a scalper’s eBay listing or a gold-plated custom build. It was the retail reality for a card that launched just months earlier with an already controversial $1,999 MSRP.

While the gaming community has spent the last year flaming Jensen Huang and Nvidia for “greed,” a deeper look into the bill of materials (BoM) reveals a different villain. Nvidia’s margins on the Blackwell series are indeed healthy; however, the recent 80% price explosion isn’t padding Nvidia’s pockets. Instead, it is being devoured by a trio of “Memory Mercenaries” (Samsung, Micron, and SK Hynix) who have successfully unbundled the cost of VRAM from the GPU silicon itself.

Analysis of Q3 Nvidia Earnings reveals that the hardware market is increasingly decoupling from consumer reality. The RTX 5090 is no longer just a graphics card; it is a victim of a geopolitical and industrial squeeze that is making high-end PC gaming a luxury hobby for the 1%.

The Escalation: A History of Scarcity

The 2025 GPU market was supposed to be a return to normalcy. After the crypto-mining craze and the pandemic supply chain snarls, the “Blackwell” generation promised massive performance gains at stable, if premium, prices. The RTX 5090 launched in late January 2025, featuring 32GB of cutting-edge GDDR7 memory.

For the first six months, the plan held. Cards were available at or near MSRP. However, by November 2025, the floor fell out. GDDR7 prices began a vertical climb, rising nearly 90% in a single quarter. Unlike previous shortages driven by miners or lockdowns, this was a systemic “capacity shift.” The memory makers realized they could make more money by not making GDDR7.

The Early Days of the Shortage

Historically, graphics memory (GDDR) was a high-volume, relatively stable commodity. But the move to the GDDR7 standard changed the physics. GDDR7 uses Pulse Amplitude Modulation (PAM3) to achieve its staggering bandwidth, making the chips harder to yield than the older GDDR6x used in the 40-series. This technical hurdle gave the “Big Three” memory makers an excuse to tighten the screws.

Understanding the Memory Mercenaries

The global DRAM (Dynamic Random-Access Memory) market is an effective oligopoly. Samsung (45% share), SK Hynix (28%), and Micron (22%) control the world’s supply of bits. When all three decide to pivot their strategy simultaneously, the impact is equivalent to an industrial heart attack.

The HBM Vortex

The primary driver of the GDDR shortage is the insatiable demand for High-Bandwidth Memory (HBM3e). HBM is the “special sauce” that makes AI accelerators, such as the Nvidia H100 and Blackwell B200, work. Because HBM yields are notoriously low (Samsung has struggled to get HBM3e yields above 50%), the memory makers have to allocate a disproportionate number of silicon wafers to meet their AI contracts.

When an AI server company like Meta or Microsoft offers to pay a a 5x premium for HBM, the memory makers have no choice but to rob Peter (the gamer) to pay Paul (the data center).

The Forbidden Question: Shortage or Sabotage?

Is the “VRAM shortage” a genuine supply constraint, or a coordinated capacity shift to force higher margins? Within the 50-person specialized Slack for “Hardware Supply Chain Analysts,” the consensus is shifting toward the latter.

The economics of a massive capacity shift are telling. Samsung is reportedly moving up to 80,000 wafers per month, nearly half of its HBM3E capacity, back to “standard” DDR5 and GDDR7. Why? Because the yields on HBM are so challenging that the effective margin on a successful GDDR7 chip, inflated by a “shortage tax,” is now actually higher than the margin on AI memory.

By creating an artificial scarcity in the consumer market, the memory mercenaries are forcing Nvidia and its AIB (Add-in Board) partners like ASUS and MSI to accept “unbundled” costs. In previous generations, Nvidia bought “kits” (GPU + Memory) and sold them to manufacturers. Now, many partners are being forced to source memory separately at spot prices, meaning they—and you—are paying the “Mercenary Tax.”

The Historical Rhyme: Sumitomo and the 1993 Resin Crisis

History doesn’t repeat, but it certainly rhymes. In July 1993, an explosion at a Sumitomo Chemical factory in Niihama, Japan, triggered a global panic. That single factory produced over 60% of the world’s supply of epoxy resin, which is the glue used to package almost every semiconductor on Earth.

Within weeks, RAM prices quadrupled. PC makers like Apple and Dell had to hike prices overnight or face bankruptcy. In late 2025, the “explosion” isn’t a factory fire; it’s the AI demand for HBM. The result is the same: a single bottleneck (wafer allocation at Samsung and SK Hynix) is dictating the cost of a device as complex as the RTX 5090.

Second-Order Effect: The Death of the $1,000 Build

The most tragic consequence of this memory squeeze is the death of the “budget” PC. In 2020, people could build a decent 1440p gaming machine for $1,000. In 2026, that same $1,000 barely covers the VRAM and the power supply.

The analysis of the Transformer Crisis shows that the infrastructure costs of the digital age are rising across the board. When VRAM costs alone exceed the entire Bill of Materials of a mid-range 2020 card, the “entry-level” gamer is effectively priced out of the hobby. The hardware market is seeing a “K-shaped” recovery: the wealthy buy $5,000 GPUs, while most others are forced onto cloud streaming services like GeForce Now.

The Data: The $10 Tax

The “Quantitative Hook” that supply chain analysts are watching is the specific “memory tax” already being passed down. As of December 2025, AMD has begun charging its partners an additional $10 per 8GB of VRAM as a “sourcing surcharge.”

This tax is set to double in January 2026.

To put that in perspective, an RTX 5090 with 32GB of VRAM will see its raw material cost increase by $80 just from this surcharge alone. By the time that reaches the consumer after AIB margins, distributor cuts, and retail markups, that $80 turns into a $300 hike at the register.

Component2024 Cost (Est.)2025 Cost (Late)2026 Projected
GPU Silicon$600$650$700
32GB GDDR7$180$340$520
PCB & VRM$120$150$180
The “Mercenary Tax”$0$100$250
Total BoM$900$1240$1650

Note: These are estimates for the raw manufacturing cost before R&D, marketing, and profit margins.

Expert Perspectives

“The industry is entering an era of ‘Resource Nationalism’ in silicon,” says an analyst from the Hardware Supply Chain Slack. “Samsung and Hynix know they hold the keys to the kingdom. If Nvidia wants 32GB of VRAM for every gamer, they have to pay the ‘AI Opportunity Cost’ for every one of those bits.”

Nvidia, for its part, is staying silent. Publicly, they blame “unprecedented demand.” Privately, they are reportedly scrambling to qualify smaller memory makers like Winbond for their lower-tier cards to break the “Big Three” stranglehold.

What’s Next?

The outlook for 2026 is grim for those hoping for a price drop. The January 2026 tax doubling is a hard ceiling on any potential discounts.

Short-Term (1-2 years)

Market analysts anticipate the “Super” refresh of the 50-series to come with less VRAM than expected. The market may see an RTX 5080 “Lite” with 12GB or 16GB as Nvidia tries to keep the shelf price under $1,200.

Medium-Term (3-5 years)

The hope lies in “Co-Packaged Optics” and new packaging techniques that allow GPU makers to use cheaper, slower RAM in larger quantities, offset by massive on-die caches. However, this architectural shift is years away.

What This Means for You

If you’re a Gamer:

  • Stop waiting for a “sale.” The BoM costs are rising too fast for retailers to offer deep discounts.
  • Consider 16GB as the “new 8GB.” If you’re building a new PC, do not buy a card with less than 16GB of VRAM, or your system will be obsolete by the time the memory prices stabilize in 2027.

If you’re an Investor:

  • Watch the Samsung earnings reports for “DRAM Margin Expansion.” If their profits are up while their wafer shipments are flat, you’re seeing the Mercenary Tax in action.
  • Be wary of AIB partners like ASUS and MSI. They are being squeezed from both sides: Nvidia’s rigid MSRPs and the memory makers’ skyrocketing spot prices.

The Final Bill

The $5,000 RTX 5090 is not an anomaly; it is a prophecy. It represents the moment the consumer PC market collided with the AI infrastructure boom. While it is easy to blame Nvidia for the price of their logo, the true architects of this crisis are the memory makers in Suwon and Boise. Until the “Memory Mercenaries” decide that gamers are as profitable as AI researchers, the PC will continue to drift into the exclusive possession of the ultra-high-net-worth individual.


Sources

🦋 Discussion on Bluesky

Discuss on Bluesky

Searching for posts...