结构化总结：内存短缺正在引发消费电子产品的重新定价

一句话摘要

AI 对高带宽内存（HBM）的爆炸性需求正在蚕食全球 DRAM 产能，导致消费电子产品——尤其是廉价智能手机——价格飙升，贫穷世界的数亿人正在被挤出智能手机拥有权。

核心论点

AI 数据中心对 HBM 的巨大需求 + 内存行业仅剩三家寡头的极度集中 + 制造商”永远让需求不被完全满足”的资本纪律 = 消费级 DRAM（DDR/LPDDR）供应骤降、价格飙升，消费电子产品正在经历历史性的重新定价。

关键数据

指标	数值
全球 DRAM 生产商数量	3 家（三星、SK 海力士、美光），占全球 90%+
HBM 晶圆分配占比变化	2023 年 2% → 2026 年底预计 20%
HBM 单位晶圆消耗量	1GB HBM = 3GB+ 普通 DRAM 的晶圆产能
LPDDR4 价格涨幅（2025Q1-2026Q1）	+250%
DDR5 价格涨幅（德国，一年内）	+414%
内存占廉价安卓手机 BOM 比例	从约 15% 飙升至 50%
全球智能手机出货量预计降幅（2026）	-13%（史上最大单年降幅）
印度 <$100 智能手机市场崩溃幅度（2026Q1）	-59% YoY
传音 2025 年净利润降幅	-54%
苹果为 iPhone LPDDR5X 支付的溢价	100%（向三星）
三大内存商 2025 年合计利润	$700 亿

因果链

AI 需求爆发 -> GPU/TPU 需要 HBM -> HBM 极度消耗晶圆（1GB = 3GB+ 普通 DRAM）
-> 制造商转移产能至 HBM（利润率 70% vs 20-30%）-> 三大寡头拒绝扩产
-> 普通 DRAM 供应骤降 -> DDR/LPDDR 价格飙升 200-400%
-> 廉价手机 BOM 中内存占比 15% -> 50% -> $50 手机变 $120+
-> 贫穷世界被挤出 -> 趋势向高端蔓延（三星亏损、苹果延期、Dell 涨价）
-> 2027 年 Vera Rubin 消耗 LPDDR > Apple + Samsung -> 富裕世界也将被波及

章节要点

1. 内存墙

处理器年进步 60%，DRAM 仅 7%（1980-90年代）
电容器缩小难度远超晶体管
单座晶圆厂造价 $150-200 亿

2. 寡头垄断与资本纪律

1990s ~20 家制造商整合至 3 家
核心教训：“永远让需求不被完全满足”

3. HBM 大淘金

SK 海力士 HBM 收入占 DRAM 收入 40%+
超大规模企业 30%+ 资本支出投向 DRAM

4. 廉价智能手机消亡

传音、OPPO、vivo、小米大幅削减出货
非洲 81% 智能手机出货属 <$200 类别

5. 危机向高端蔓延

三星手机事业部无法从自家内存事业部拿到长期协议
苹果 iPhone 18 延期至 2027 春

6. 唯一希望：中国长鑫存储

占中国 LPDDR 30%+，但也计划 20% 产能转 HBM

延伸思考

AI 以讽刺方式加深数字不平等——声称”民主化智能”的技术正在剥夺最贫穷人群的基本数字接入权
内存产能是刚性约束，AI 与消费电子的竞争是零和的，直到新产能上线（2027-2028）
三家公司控制 90% 全球产能 + “永不过度投资”行为模式 = 结构性供应限制
DeepSeek 多头潜在注意力、量化等是需求侧变通，但速度赶不上需求增长

内存短缺正在引发消费电子产品的重新定价

Simon Willison 评注：David Oks 给出了我所见过的最清晰的解释——为什么使用内存的消费产品在未来几年可能会显著涨价。简短的版本是：内存制造商——仅剩三家大公司——在任何时间点上能加工的晶圆数量是固定的。这些固定的晶圆产能被分配给三类产品：DDR（用于台式机和服务器）、LPDDR（用于手机和低功耗设备）和 HBM（用于 GPU）。直到最近，HBM 只占晶圆分配的 2%。AI 数据中心的巨大增长已将这一比例推高至 2026 年底的预期 20%，而”一个千兆字节的 HBM 消耗的晶圆产能是一个千兆字节 DDR 或 LPDDR 的三倍以上”。

以下为 David Oks 原文翻译：

AI 正在杀死廉价智能手机

过去几十年最令人瞩目的事情之一，是电脑变得多么便宜。

1985 年，如果你是一个相当富裕的美国人，你能买到的最好电脑是 IBM PC AT。PC AT 售价约 6,000 美元——按 2026 年美元计算约 19,400 美元——大约相当于美国人年收入中位数的四分之一；它搭载 Intel 80286 处理器，每秒大约能执行 90 万条指令。而今天，如果你在内罗毕或拉各斯的市场摊位上转转，你可以找到一部廉价智能手机——比如中国传音制造的 Tecno Spark Go——价格在 30 到 120 美元之间。这部手机的处理器每秒能执行数十亿次计算。

换句话说：你可以用 40 年前最佳消费设备 0.3% 的价格，买到一台强大数千倍的电脑。历史上没有其他商品经历过如此规模的成本下降：穷人现在口袋里装着的电脑，比几十年前世界上最富裕阶层能买到的设备强大许多个数量级。而消费电子产品的这场伟大降价，使得计算能力向全球人口扩散，堪称奇迹。数以亿计的世界最贫困人口能够访问互联网，正是因为像 Tecno Spark Go 这样的廉价智能手机。

这个时代正在走向终结。

2026 年，追踪智能手机市场的国际数据公司（IDC）预测，全球智能手机出货量将下降 13%，创下有史以来最大单年降幅。非洲和中东的跌幅最为剧烈，智能手机出货量将下降超过 20%，并且集中在智能手机行业最廉价的产品线上。这场冲击不是暂时的波动，而是”整个市场的结构性重置”：全球很大一部分人口正在被智能手机的价格挡在门外。

过去几十年消费电子产品越来越好、越来越便宜的趋势面临急剧逆转：贫穷世界正在进入一场智能手机危机。

这背后的原因很简单。

智能手机和其他电脑一样需要内存，而全球内存供应的弹性极低，因为内存真的非常难以生产。长期以来，大部分内存流向了智能手机和笔记本电脑；但在过去几年里，AI 作为内存的巨大且利润丰厚的消费者崛起。这导致了内存从消费电子大规模转移到 AI。不可避免的结果是，智能手机的制造成本比几年前贵了很多。短期内，这意味着那种将计算和互联网访问带给世界最贫困地区的廉价智能手机已经死了。

但按照目前的趋势，贫穷世界只是第一个被冲击的。如果 AI 消耗继续以当前速度增长——或者加速增长，这似乎显而易见——智能手机危机蔓延到富裕世界只是时间问题。消费电子产品即将变得昂贵得多。

关键就在于你拥有的内存

智能手机就是电脑。它们是非常小的电脑，还有触摸屏和无线电之类的东西。但就内部架构而言，智能手机基本上和笔记本电脑或服务器是一样的。它们有执行计算和运行逻辑的处理器。有保存处理器当前正在处理的数据的内存。有在设备关机时保留数据的存储。还有把所有这些东西连接在一起的电路板。

过去几十年计算领域的大故事是处理器。你可以把处理器想象成一个巨大的晶体管阵列——微小的开关，通过翻转开/关来执行逻辑运算。我们在缩小晶体管、提高其效率方面做得非常好，这意味着过去几十年处理器以指数级速率改进。这就是摩尔定律。

但处理器只能处理它能访问到的数据：而它能访问到的数据来自内存：具体来说，在现代电脑中来自 DRAM（“动态随机存取存储器”）。在这方面，故事大不相同。DRAM 确实在进步，但进步速度远不及处理器：在 1980 和 90 年代，处理器速度每年提高 60%，而 DRAM 速度每年只提高 7%。

这意味着过去几十年，计算机性能的主要瓶颈一直是内存。计算机科学家称之为”内存墙”。过去几十年计算机体系结构的大量工作都在寻找各种方法来绕过处理器和 DRAM 之间的不匹配。

那么为什么 DRAM 的进步没有处理器快？

简单来说：这就是一个非常难的问题。就像处理器是一个巨大的晶体管阵列，内存芯片基本上是一个巨大的存储单元阵列：每个存储单元既有一个晶体管，又有一个叫做电容器的存储单元，用于保存对应一个比特数据的电荷。我们知道如何缩小晶体管。但缩小电容器要困难得多。电容器越小，就越难可靠地存储电荷：电荷可能会泄漏、消失，或被相邻单元的干扰所改变。因此，如果你想让 DRAM 更高效，你需要采用各种越来越奇特的架构。

这正是所发生的。DRAM 需要变得更高效，以跟上处理器的改进步伐。所以现代 DRAM 制造是一个极其复杂和昂贵的过程。建造一座最先进的 DRAM 制造工厂（“晶圆厂”）需要约 150 到 200 亿美元；购买所有必要设备（如光刻工具和蚀刻机）还需要几十亿美元；然后你还需要几年时间生产不合格和有缺陷的内存芯片，直到良率开始具有竞争力。

这引出了 DRAM 制造商——“内存制造商”——的特殊经济学。

关于内存，除了它昂贵且难以制造之外，最重要的一点是它具有可替代性。处理器是定制的：你不能把 Intel 芯片换成 Apple 芯片。但内存芯片不是定制的。DRAM 芯片都遵循相同的行业标准，所以一家内存制造商的芯片可以插入与另一家芯片相同的设备中。换句话说，DRAM 是一种大宗商品。

而”资本密集型制造加上可替代性”是一个残酷的组合。因为内存可替代，这个行业具有强烈的周期性：整个 DRAM 行业的历史就是繁荣-萧条超级周期的历史。首先，来自某个领域的强劲需求——比如 1990 年代 Windows PC 的普及——推动价格飙升和每个玩家的投资浪潮；对无差异化商品的累积过度投资产生供过于求；然后供过于求导致价格崩溃。

因为生产如此昂贵，这些下行周期往往是生死存亡的：内存行业充满了不断的残骸。Intel 在 1970 年代初期主导内存游戏，但在 1980 年代退出，选择专注于处理器。德州仪器和 IBM 也曾是主要玩家，在 1990 年代退出。德国的 Qimonda 在 2009 年倒闭；日本的 Elpida，曾经世界第三大 DRAM 制造商，在 2012 年宣布破产。

数十年的倒闭和整合只留下了几家公司。1990 年代，全球大约有 20 家有意义的 DRAM 生产商；今天只有三家占全球产量 90% 以上。韩国有两家，SK 海力士和三星；美国有一家，美光。

这些内存制造商从行业的无情历史中学到了一个非常特殊的教训：永远让需求得不到完全满足。在资本密集型和周期性行业中生存的唯一方式是展示几乎超人般的资本纪律。需求现在可能上升，但总会下降。所以让价格飙升、让边际内存消费者被挤出，也好过扩大产能然后在需求不可避免地走软时面临毁灭。

而这，对智能手机客户来说是一个残酷的算计。

HBM 大淘金

前面我说内存是”可替代的”。这需要一个补充说明。内存在制造商之间是可替代的：三星的芯片可以插入与 SK 海力士芯片相同的设备中。但这并不意味着所有电脑以相同方式使用内存。我正在用来写这篇文章的 MacBook Pro 需要能跟上强大处理器同时运行多个程序的内存：所以它使用一种叫做 DDR（“双倍数据速率”）的标准，以较高电压运行并提供高带宽。我 iPhone 上的处理器不那么强大，所以在任何给定时刻需要更少的数据；但电压非常重要，因为分配给内存的每一毫瓦都从电池中消耗。所以智能手机使用 LPDDR（“低功耗双倍数据速率”），DDR 的一个变体，设计在更低电压下运行。而在运行 Claude 和 ChatGPT 的数据中心中，使用的是完全不同的标准：HBM（“高带宽内存”），这个我稍后再谈。

这三种都是以相同方式、从相同起始材料制造的。内存制造商接收叫做晶圆的薄硅片；在几个月内，他们在上面蚀刻数十亿个存储单元；然后把晶圆切割成单独的芯片并发货。

因此，内存制造商面临的关键问题是如何在 DDR、LPDDR 和 HBM 之间分配晶圆。一定比例的晶圆分配通过与主要购买者（如 Apple 或 Dell）的长期协议锁定；还有一些在现货市场出售，给那些希望灵活性或规模不够签长期协议的买家。所以每个季度，三星、SK 海力士和美光的晶圆分配团队会根据价格、合同和他们对未来需求方向的最佳猜测来决定如何在三个类别之间分配晶圆。

在行业历史的大部分时间里，这种分配很直接。在 2010 年代末，DDR、LPDDR 和 HBM 的利润率大致相似；内存制造商最感兴趣的是数量，晶圆分配基本上跟踪终端市场需求。手机是内存的最大单一市场，所以 LPDDR 获得了最多的晶圆。DDR 占了大部分剩余份额。而 HBM 是面向高性能计算客户的小众产品，只获得了很小的份额。

AI 的出现彻底改变了这一切。

训练和运行 AI 模型的计算强度极高。即使简单的查询也需要数十亿次矩阵乘法，按顺序和并行方式反复执行。AI 工作负载需要能够并行执行大量操作的计算机——这就是为什么像 Nvidia 的 GPU 和 Google 的 TPU 这样的专用硬件变得如此重要。但因为 GPU 和 TPU 同时执行如此多的计算，它们需要以相应的巨大速率获取数据。否则昂贵的硬件就会闲置。换句话说，需要的是专门设计用于同时向许多处理器传输大量数据、以极高速度运行的内存。

这正是 HBM 的设计目的。

HBM 的核心思想很简单。你取很多 DRAM 裸片，把它们堆叠在一起，用数千个微小的垂直通道连接它们，使许多数据路径可以并行运行，然后把整个堆栈放在 GPU 或 TPU 旁边。实际做到这一点非常困难。但如果成功了，你可以传输比 DDR 多一个数量级的数据。

HBM 的问题（除了生产困难之外）是它极其消耗晶圆。不仅仅是你在堆叠很多裸片。由于所有外围电路和所有垂直通道，一个千兆字节的 HBM 消耗的晶圆产能是一个千兆字节 DDR 或 LPDDR 的三倍以上。每生产一个千兆字节的 HBM，实际上就是三个千兆字节的普通内存没有被生产出来。

很长一段时间里，这并不重要，因为 HBM 需求很小。当 ChatGPT 在 2022 年 11 月发布时，内存制造商正处于需求低迷期，他们花了一些时间才意识到情况已经发生了变化。2023 年初，行业贸易媒体仍在观望，报道仅限于 “AI 聊天机器人可能有助于缩短 DRAM 市场低迷期” 这样的建议。

但 HBM 需求的增长远超内存制造商的预期。AI 使用量持续爆发；随着使用从聊天机器人转向更密集的模型——长时间运行的代理——很明显 HBM 的需求将比任何人最初预期的大得多。内存制造商措手不及。到 2024 年底，全面的 HBM 短缺已经出现；到 2025 年，HBM 利润率达到 70% 或更高，而 DDR 和 LPDDR 的利润率在 20% 到 30% 之间。

对内存制造商来说，理性的应对很明显：大量生产 HBM。于是他们重新分配了大量产能。2023 年，HBM 占内存制造商晶圆的 2%；2024 年 5%；2025 年 10%；到 2026 年底，预计将达到 20%，另有 3% 分配给面向 AI 服务器的高密度 DDR。短短三年内，HBM 从一个边缘产品类别变成了内存行业的核心。率先实现前沿 HBM 节点量产的 SK 海力士，仅 2024 年 HBM 收入就增长了四倍；到当年年底，HBM 占公司 DRAM 收入的 40% 以上，两年前这一比例约为 5%。

但即使这种重新分配也不够。需求继续超过供应，内存短缺仍然是 AI 建设的标志性特征之一。（这反过来也催生了各种变通方案，如量化或 DeepSeek 的多头潜在注意力。）内存争夺战如此激烈，以至于在 2025 年底，微软和谷歌等超大规模企业的高管据报道”几乎常驻韩国”，游说三星和 SK 海力士争取份额。超大规模企业超过 30% 的资本支出现在都花在 DRAM 上。

这对内存制造商来说是极好的消息。2025 年，它们共赚了 700 亿美元利润；2026 年预计将赚取两倍以上。三星、SK 海力士和美光现在是世界上最赚钱的公司之一。

但对普通 DRAM 的购买者来说，情况就没那么美好了。

AI 正在吞噬廉价智能手机……

回想一下我们前面说的关于内存制造商的资本纪律。SK 海力士、三星和美光靠拒绝——几乎是原则性地——供应足够的芯片来满足所有客户的需求来度过之前的 DRAM 周期：Elpida 和 Qimonda 的教训是闲置的晶圆厂是致命的，而未满足的需求则不是。

所以当内存制造商在 2024 年和 2025 年初看到 HBM 订单的上升潮时，他们采取了刻意保守的态度，拒绝扩大产能。直到 2025 年，随着内存价格开始史无前例的飙升，内存制造商才开始建设针对 HBM 的新晶圆厂，全部计划在 2027 或 2028 年开始投产。即使现在他们也小心翼翼地不过度扩大产能。迟至 2025 年 12 月，三星仍强调将”优先考虑长期盈利能力，而不是快速扩张产能”。

这意味着内存制造商满足 HBM 需求飙升的唯一方式是将晶圆从 DDR 和 LPDDR 转移出来。正如 Tom’s Hardware 在 2025 年底报道的：“在晶圆开工量持平和封装线锁定的情况下，每一片推入 HBM 的晶圆都在减少普通 DRAM 的产能。“到 2025 年底，SK 海力士将其晶圆产能的 30% 分配给了 HBM，几乎所有这些产能都是从 DDR 和 LPDDR 那里拿走的。与此同时，美光选择完全退出消费级 DRAM 市场。2025 年 12 月，美光停止了面向消费者的 Crucial 品牌，宣布将停止所有消费出货，将全部产能转向 AI 和企业。

因此，过去几年 DDR 和 LPDDR 的供应量急剧下降。价格相应飙升。2025 年第一季度到 2026 年第一季度之间，LPDDR4 标准的价格上涨了 250%；LPDDR5 价格上涨了 220%。在某些市场角落，涨幅更为剧烈：德国的 DDR5 价格在一年内上涨了 414%。

因此，内存已迅速成为消费电子产品中最昂贵的组件。廉价安卓手机物料清单中内存的占比已从大约 15% 飙升至高达 50%。

这意味着所有消费电子产品价格上涨。但对边际消费者——那些最无力承受涨价的人——打击尤为沉重。在内存的案例中，这意味着廉价智能手机的制造商和消费者。

长期以来，廉价智能手机公司——如传音、OPPO、vivo 和 Lava——遵循一个简单的模式。它们在现货市场购买上一代组件，廉价组装成安卓手机，然后以极低价格出售成品。廉价手机制造商利润率极薄，通常在低个位数百分比；但它们的销量巨大。例如传音在 2024 年出货了 1.05 亿部手机，相比苹果的 2.3 亿部。而在更便宜的市场，如非洲或南亚，这些公司占主导地位：传音仅在非洲就占据了 48% 的智能手机市场。

但当内存价格像现在这样飙升时，这种模式就崩溃了。低于 100 美元的智能手机面临着”永久性不经济”的风险。

这意味着廉价智能手机制造商被迫将内存成本转嫁给消费者：原来售价 50 美元的手机现在售价 120 美元或更高。而价格敏感的消费者的反应是干脆不买手机。2026 年初，传音宣布其 2025 年净利润下降了 54%，并将年出货目标削减 40%。其他中低端智能手机公司也面临同样的情况。OPPO 将出货目标削减了 20% 以上；同样处境的 vivo 削减了近 15%。2026 年第一季度，小米的年出货量同比下降了 19%。

这种重新定价对贫穷国家产生了严峻的影响。在印度，2026 年第一季度低于 100 美元的智能手机市场同比暴跌 59%：飙升的内存价格导致了印度智能手机市场的”强制高端化”。但在最贫穷的市场，这种高端化是不可能的。2025 年，非洲 81% 的智能手机出货量属于低于 200 美元的类别：随着智能手机价格飙升，许多非洲消费者将完全被挤出手机拥有权之外。

……也将吞噬昂贵的智能手机

这就是我们现在的处境。HBM 需求已经在挤压 DDR 和 LPDDR；这已经导致越来越多的消费者被挤出智能手机拥有权。

但没有理由认为这种趋势将仅限于最贫穷的消费者。DRAM 食物链上更高层的公司开始感受到内存价格上涨的痛苦；富裕世界的消费者感到自己被挤出电子产品市场只是时间问题。

我们已经看到了早期迹象。例如，三星的消费事业部发现自己无法从三星的内存事业部那里获得长期 LPDDR 协议；因此它不得不在 Galaxy S26 手机中搭载比预期更少的内存，并以更高的价格出售。这并没有太大帮助：三星高管警告说，公司将录得智能手机业务有史以来首次年度净亏损。（当然，这被其内存业务的巨额利润所抵消。）Dell 也出现了同样的重新定价，2025 年 12 月笔记本价格上涨了 15% 到 20%。

即使是电子产品世界的王者苹果，也开始感受到内存成本的冲击。苹果传统上在韩国内存制造商那里拥有显著的议价能力，通过谈判长期协议来平滑多年的价格；但现在内存制造商掌握了主动权。当苹果最新的长期协议在 2026 年 1 月到期时，内存制造商拒绝签署超过一个季度的协议。2 月，为了确保供应，苹果同意为 iPhone 使用的 LPDDR5X 内存向三星支付 100% 的溢价。

因此，过去六个月苹果面临的定价压力大幅增加。2025 年全年，为 iPhone 17 Pro 提供动力的 12GB LPDDR5X 芯片价格上涨了 230%；没有长期协议的保护，苹果将承受内存危机的全面冲击。为了应对，苹果在过去几个月宣布了一系列延迟。iPhone 18 标准版已推迟到 2027 年春季；新 Mac Studio 从夏季推迟到秋季。

没有迹象表明情况很快会好转。事实上，即使内存制造商停止将晶圆产能重新分配给 HBM，LPDDR 仍将面临巨大压力。2026 年最后一个季度，Nvidia 将推出其新的 Vera Rubin 平台，这是一个机架级 AI 超级计算机，将 Rubin GPU 与 Vera CPU 配对成一个为大规模 AI 训练和推理构建的单一系统。Vera CPU 将对 LPDDR 有巨大需求：到 2027 年，Vera Rubin 预计将消耗比苹果和三星加起来还多的 LPDDR。摩根大通的一份报告预测，到 2027 年内存可能占 iPhone 组件成本的 45%，而目前约为 10%。年内苹果将被迫做出决定：要么削减利润率以保卫市场份额，要么大幅提高产品价格。

综上所述：情况在好转之前会变得更糟。

我们已经到了贫穷世界的边际买家被挤出智能手机市场的地步。我们正在迅速接近富裕世界的买家也感受到同样压力的时刻。短期内，智能手机制造商可能通过显著减少每台设备的内存量（从而降低设备性能）来应对，或者干脆通过大幅涨价来摧毁需求。LPDDR 和 DDR 的利润率已经飙升，甚至可能高于 HBM 的利润率：但如此多的 HBM 产能已通过长期协议锁定，短期内不会有转向普通 DRAM 的可能。如果说有希望的话，那就来自中国。新兴的中国内存制造商——如长鑫存储，已经占据中国 LPDDR 市场 30% 以上的份额——正在迅速扩大规模，希望填补 DDR 和 LPDDR 的缺口。

但只要我们面临 AI 数据中心的内存短缺，DRAM 短缺的经济学就难以逃脱。超大规模企业愿意出价高过廉价手机制造商来获取 DRAM 的访问权：甚至长鑫也计划将约 20% 的产能转换为 HBM。

因此，未来几年消费电子产品的大重新定价将很难避免。我们已经身处一个贫穷世界消费者被挤出的世界；我们正在迅速接近一个富裕世界消费者也被挤出的世界。过去几十年的技术进步使计算民主化了；但那个时代已经结束。消费电子产品每年更快、更便宜、更强大的长期趋势已经逆转。首先感受到它、感受最深的是世界上的穷人：但不久之后我们也会感受到这种紧缩。

The memory shortage is causing a repricing of consumer electronics

The memory shortage is causing a repricing of consumer electronics (via) David Oks provides the clearest explanation I’ve seen yet of why consumer products that use memory are likely to get significantly more expensive over the next few years.

The short version is that memory manufacturers - of which there are just three remaining large companies - have a fixed capacity in terms of how many wafers they can process at any one time. This fixed wafer capacity is then split between DDR - used in desktops and servers, LPDDR - used in mobile phones and low-energy devices, and HBM - used with GPUs.

Until recently, HBM got just 2% of that wafer allocation. The enormous growth in AI data centers has pushed that up to an expected 20% by the end of 2026, and “a single gigabyte of HBM consumes more than three times the wafer capacity that a gigabyte of DDR or LPDDR does”.

Memory companies have learned from the extinction of their rivals that you should always under-provision rather than over-provision your fabricator capacity. The profit margins and demand for HBM (high-bandwidth memory) will constrain the production of consumer-device RAM for several years.

This is already being felt in the sub-$100 smartphone market, which is particularly important to markets like Africa and South Asia.

(The original title of the piece was “AI is killing the cheap smartphone” but I’m using the Hacker News rephrased title, which I think does more justice to the content.)

Posted 22nd May 2026 at 10:01 pm

AI is killing the cheap smartphone

One of the most remarkable things about the last few decades is how cheap computers have gotten.

In 1985, if you were a reasonably affluent American, the best computer that you could afford was the IBM PC AT. The PC AT would cost you about $6,000—$19,400 in 2026 dollars—and thus represented about a quarter of the median American’s annual income; and it ran on an Intel 80286 processor, capable of something like 900,000 instructions per second. Today, if you find yourself in a market stall in Nairobi or Lagos, you’ll be able to find a cheap smartphone—like the Tecno Spark Go, manufactured by China’s Transsion—for somewhere between $30 and $120. That phone will run on a processor capable of billions of calculations per second.

In other words: you can buy a computer thousands of times more powerful than the best consumer device from 40 years ago, for something like 0.3 percent of the price. No other good in history has experienced a decline in cost on that scale: poor people can now carry around in their pockets computers many orders of magnitude more powerful than what the richest slice of the world’s population could afford a few decades ago. And that great cheapening of consumer electronics has enabled a diffusion of computing power to the world’s population that is nothing short of miraculous. Hundreds of millions of the world’s poorest people are able to access the internet because of cheap smartphones like the Tecno Spark Go.

That era is now coming to an end.

In 2026, the International Data Corporation, which tracks the smartphone market, predicted that worldwide smartphone shipments would fall 13 percent, their largest single-year decline ever. The crash would be most intense in Africa and the Middle East, where smartphone shipments would fall by more than 20 percent, and would be concentrated in the cheapest end of the smartphone industry. This shock represented not a temporary blip but indeed “a structural reset of the entire market”: a huge share of the world’s population is getting priced out of smartphone ownership.

So the trend of the last few decades, of consumer electronics getting better and cheaper every year, faces a sharp reversal: the poor world is now entering a smartphone crisis.

This is happening for a simple reason.

Smartphones, like other computers, use memory: and the global supply of memory is remarkably inelastic, because memory is really hard to produce. For a long time, most memory went to smartphones and laptops; but in the last few years, AI has emerged as an enormous and hugely profitable consumer of memory. This has resulted in a huge reallocation of memory away from consumer electronics and toward AI. The inevitable result is that smartphones are much more expensive to make now than they were a few years ago. In the short term, this means that the cheap smartphone, which spread computing and internet access to the poorest parts of the world, is dead.

But at the rate that things are going, it seems like the poor world will only be the first to get hit. If AI consumption continues to grow at current rates—or if it accelerates, as seems manifestly possible—it won’t be long before the smartphone crisis spreads to the rich world. Consumer electronics are about to get much more expensive.

It’s just the memory that you have

Smartphones are computers. They’re very small computers, and also have things like touchscreens and radios. But in terms of their internal architecture, smartphones are basically the same as what you’d get with a laptop or a server. They have a processor that performs calculations and runs the logic that makes the device do what you tell it to do. They have memory that holds the data that the processor is currently working on. They have storage that retains data when the device is turned off. And they have a circuit board that connects all these different things together.

The big story of computing over the last few decades is the processor. You can think of the processor as a huge array of transistors—tiny switches that flip ON and OFF to perform logical operations. We’ve done a good job—a very good job—of figuring out ways to make transistors smaller and more efficient, which means that processors have improved at an exponential rate over the last few decades. This is Moore’s Law.

But processors can only process the data that they have access to: and the data that they have access to is what they get from memory: specifically, in modern computers, from DRAM, “dynamic random access memory.” Here the story is very different. DRAM has gotten better; but it hasn’t gotten better at anything like the rate that processors have: in the 1980s and ’90s, processor speeds improved at 60 percent per year, while DRAM speeds improved at just 7 percent per year.

And that means that for the last few decades, the main bottleneck for computer performance has been memory. Computer scientists call this the “memory wall.” A huge amount of the work in computer architecture over the last few decades has been finding various ways around the mismatch between processors and DRAM.

So why hasn’t DRAM improved as fast as processors?

Simply put: it’s just a really hard problem. Just like a processor is a huge array of transistors, a memory chip is basically a huge array of memory cells: and each memory cell has both a transistor and a storage unit called the capacitor, which holds the electrical charge corresponding to an individual bit of data. We know how to shrink the transistor. But shrinking the capacitor is a lot harder. As the capacitor gets smaller, it becomes harder for it to reliably store its electrical charge: the charge might leak out, or disappear, or be altered by interference from its neighbors. So if you want to make DRAM more efficient, you need to resort to all sorts of increasingly exotic architectures.

And that’s exactly what’s happened. DRAM needs to get more efficient, in order to keep up with the improvements in processors. So modern DRAM manufacturing is an extraordinarily complex and expensive process. Building a single state-of-the-art DRAM fabrication facility, a “fab,” will cost you about $15 to $20 billion; acquiring all the necessary equipment, like lithography tools and etching machines, will cost you another few billion; and then it’ll take you a few years of producing substandard and defective memory chips before your yields start to look competitive.

Which leads us to the peculiar economics of the companies that manufacture DRAM: the “memory makers.”

The most important thing to know about memory, beyond the fact that it’s expensive and difficult to make, is that it’s fungible. Processors are bespoke: you can’t swap an Intel chip for an Apple chip. But memory chips are not bespoke. DRAM chips all conform to the same industry-wide standards, so a chip from one memory maker will slot into the same device as a chip from any other. DRAM, in other words, is a commodity.

And that combination—capital-intensive manufacturing plus fungibility—is a punishing combination. Because memory is fungible, the industry is intensely cyclical: the entire history of the DRAM industry is a history of boom-and-bust supercycles. First, strong demand from one sector or another—like Windows PC adoption in the 1990s—drives surging prices and a wave of investment from every player; cumulative overinvestment in an undifferentiated good produces oversupply; and then oversupply leads to collapsing prices.

And because production is so expensive, those down-cycles turn out to be existential: the memory industry is marked by constant wreckage. Intel dominated the memory game in the early 1970s but left in the 1980s, opting to focus on processors. Texas Instruments and IBM, also once major players, left in the 1990s. Germany’s Qimonda collapsed in 2009; Japan’s Elpida, once the world’s third-largest DRAM manufacturer, declared bankruptcy in 2012.

And decades of collapse and consolidation left only a few players standing. In the 1990s, there were perhaps 20 meaningful producers of DRAM around the world; today there are three that account for more than 90 percent of global production. South Korea has two, SK Hynix and Samsung; and the United States has one, Micron.

And these memory makers have learned a very particular lesson from the unforgiving history of their industry: always leave demand unmet. The only way to survive in a capital-intensive and cyclical industry was to demonstrate an almost superhuman degree of capital discipline. Demand might rise now, but it would always fall. So it was better to let prices spike and see the marginal memory consumer priced out than to expand production and risk destruction when demand inevitably softened.

And this, it turns out, is a brutal calculus for smartphone customers.

The great HBM rush

Earlier, I said that memory is “fungible.” That requires a qualification. Memory is fungible between manufacturers: a chip from Samsung will slot into the same device as a chip from SK Hynix. But that doesn’t mean all computers use memory in the same way. The MacBook Pro on which I’m writing this piece needs memory that can keep up with a powerful processor running many programs at once: so it uses a standard called DDR, “double data rate,” which runs at a reasonably high voltage and offers high bandwidth. The processor on my iPhone is less powerful, so it needs less data at any given moment; but voltage matters enormously, since every milliwatt allocated to memory is drained from the battery. So smartphones use LPDDR, “low-power double data rate,” a variant of DDR engineered to operate at lower voltages. And in the data centers where Claude and ChatGPT are run, an entirely different standard is used: HBM, “high-bandwidth memory,” which I’ll get back to shortly.

All three of these are made the same way, from the same starting material. Memory makers receive thin silicon discs called wafers; over several months, they etch billions of memory cells onto them; and then they cut wafers into individual chips and ship them.

The key question facing a memory maker, then, is how to allocate its wafers between DDR, LPDDR, and HBM. Some percentage of wafer allocation is locked in through long-term agreements with major purchasers, like Apple or Dell; and some is sold on the spot market, to buyers who want flexibility or lack the scale for long-term agreement. So every quarter, the wafer allocation teams at Samsung, SK Hynix, and Micron decide—based on prices, contracts, and their best guesses about the direction of future demand—how to distribute their wafers across the three categories.

For most of the history of the industry, this allocation was straightforward. In the late 2010s, margins were broadly similar for DDR, LPDDR, and HBM; what interested the memory makers most was volume, and wafer allocation basically tracked end-market demand. Phones were the single largest market for memory, so LPDDR got most of the wafers. DDR took most of the rest. And HBM was a niche product for high-performance computing customers, so it got only a small sliver.

That changed dramatically with AI.

Training and running AI models is extraordinarily computationally intensive. Even simple queries require billions of matrix multiplications, done in sequence and in parallel, over and over again. AI workloads need computers that can do enormous numbers of operations in parallel — which is why specialized hardware like Nvidia’s GPUs and Google’s TPUs has become so important. But because GPUs and TPUs perform so many calculations at once, they need to be fed data at a correspondingly enormous rate. Otherwise the expensive hardware sits idle. What was needed, in other words, was memory engineered to deliver vast quantities of data to many processors at once, at extraordinarily high speeds.

That is exactly what HBM was designed to do.

The core idea of HBM is simple. You take lots of DRAM dies, stack them on top of each other, connect them with thousands of tiny vertical channels so that many data paths can operate in parallel, and then place the whole stack right next to the GPU or the TPU. Actually doing this is very hard. But if it works, you can transfer an order of magnitude more data than you could with DDR.

The catch with HBM, beyond the difficulty of producing it, is that it is enormously wafer-intensive. It is not just that you are stacking a lot of dies together. Because of all the peripheral circuits and all the vertical channels, a single gigabyte of HBM consumes more than three times the wafer capacity that a gigabyte of DDR or LPDDR does. Every gigabyte of HBM produced is, in effect, three gigabytes of commodity memory not produced.

For a long time, this didn’t really matter, because HBM demand was small. When ChatGPT was released in November 2022, the memory makers were in the middle of a demand slump, and it took them some time to register that something had shifted. In early 2023, the industry trade press was still hedging, with reporting limited to suggestions that “AI chatbots may help shorten the DRAM market slump.”

But HBM demand increased much faster than the memory makers expected. AI usage continued to explode; and as usage shifted to more intensive models—from chatbots to long-running agents—it became clear that demand for HBM would be much, much bigger than anyone had originally anticipated. The memory makers were caught flat-footed. By the end of 2024, a full HBM shortage had set in; by 2025, HBM margins were running at 70 percent or higher, while margins for DDR and LPDDR sat between 20 and 30 percent.

The rational response, for the memory makers, was obvious: pump out more HBM. And so they reallocated a massive amount of capacity. In 2023, HBM accounted for 2 percent of the memory makers’ wafers; in 2024, 5 percent; in 2025, 10 percent; and by the end of 2026, the share is expected to hit 20 percent, with an additional 3 percent allocated toward high-density DDR for AI servers. And so in the space of three years HBM went from a peripheral product category to the very core of the memory industry. SK Hynix, which had been first to reach volume production of the leading-edge HBM node, saw its HBM revenue increase fourfold in 2024 alone; by the end of that year, HBM accounted for more than 40 percent of the company’s DRAM revenue, up from roughly 5 percent two years earlier.

But even this reallocation hasn’t been enough. Demand continues to outrun supply, and the memory shortage remains one of the defining features of the AI buildout. (It has, in turn, produced all sorts of workarounds, like quantization or DeepSeek’s multi-head latent attention.) So heated has the race for memory become that at the end of 2025, executives from hyperscalers like Microsoft and Google were reportedly “practically taking up permanent residence in Korea” lobbying Samsung and SK Hynix for allocation. More than 30 percent of hyperscaler capital expenditure is now going to DRAM alone.

This has been fantastic news for the memory makers. In 2025, they earned a collective $70 billion in profit; in 2026 they’re expected to earn more than double that amount. Samsung, SK Hynix, and Micron are now among the most profitable companies in the world.

But things are not so happy for the purchasers of commodity DRAM.

AI is eating the cheap smartphone…

Recall what we said earlier about the capital discipline of the memory makers. SK Hynix, Samsung, and Micron survived previous DRAM cycles by refusing, almost as a matter of principle, to supply enough chips to meet all their customers’ demand: the lesson of Elpida and Qimonda was that idle fabs were fatal, while unmet demand was not.

So when the memory makers looked at the rising tide of HBM orders in 2024 and early 2025, they took a deliberately conservative approach and refused to expand production. It was only in 2025, as memory prices began an unprecedented surge, that the memory makers started to build new fabs targeted at HBM, all slated to start producing chips in 2027 or 2028. Even now they’ve been careful not to expand capacity too drastically. As late as December 2025, Samsung stressed that it would “prioritize long-term profitability over rapid capacity expansion.”

And that meant that the only way that the memory makers could meet surging HBM demand was to reallocate wafers away from DDR and LPDDR. As Tom’s Hardware reported at the end of 2025: “with wafer starts flat and packaging lines locked, every wafer pushed into HBM removes capacity from commodity DRAM.” By the end of 2025, SK Hynix was allocating 30 percent of its wafer capacity to HBM, with almost all of that capacity having been taken away from DDR and LPDDR. Micron, meanwhile, opted to simply exit the commodity DRAM market entirely. In December 2025, Micron discontinued its consumer-oriented Crucial brand and announced that it would cease all consumer shipments, redirecting all capacity to AI and enterprise.

And so the supply of memory available for DDR and LPDDR has cratered over the last few years. Accordingly prices have spiked. Between the first quarter of 2025 and the first quarter of 2026, prices for the LPDDR4 standard increased 250 percent; LPDDR5 prices increased by 220 percent. In some corners of the market, the spike was more severe: DDR5 prices in Germany increased 414 percent over the course of a year.

And so memory has rapidly become the most expensive component going into consumer electronics. The memory share of the bill of materials on a budget Android phone has gone from around 15 percent to as much as 50 percent.

This means higher prices for all consumer electronics. But it’s particularly damaging for the marginal consumers, those least able to pay higher prices. In the case of memory, that means the makers and consumers of budget smartphones.

For a long time, the budget smartphone companies—like Transsion, Oppo, Vivo, and Lava—followed a simple model. They would buy last-generation components on the spot market, assemble them cheaply as Android handsets, and then sell the finished product for an extremely low price. The budget phone makers had extremely thin margins, usually somewhere in the low single digits; but they sold phones at huge volumes. Transsion, for example, shipped 105 million phones in 2024, compared to Apple’s 230 million. And in cheaper markets, like Africa or South Asia, these companies were dominant: Transsion alone held 48 percent of the African smartphone market.

But that model breaks when memory prices spike as much as they’re now spiking. The sub-$100 smartphone risks becoming “permanently uneconomical” as a product.

And that means that the budget smartphone makers have been forced to pass memory costs onto consumers: smartphones that sold for $50 are now selling for $120 or more. And price-sensitive consumers have responded by simply not buying phones. In the early months of 2026, Transsion announced that its net profit for 2025 had fallen by 54 percent, and that it would cut its annual shipment target by 40 percent. We’re seeing the same with other low-market and mid-market smartphone companies. Oppo slashed its shipment target by more than 20 percent; Vivo, in the same position, cut by nearly 15 percent. In the first quarter of 2026, Xiaomi’s annual shipments fell 19 percent year over year.

And that repricing has had a stark effect in poor countries. In India, the sub-$100 smartphone market collapsed 59 percent year-on-year in the first quarter of 2026: surging memory prices resulted in a “forced premiumization” of the Indian smartphone market. But in the poorest markets, such premiumization isn’t a possibility. In 2025, 81 percent of smartphone shipments in Africa were in the sub-$200 category: as smartphone prices surge, many African consumers will simply be priced out of phone ownership entirely.

…and will eat the expensive one too

That’s where we are now. HBM demand is already crowding out DDR and LPDDR; and this is already resulting in a large and growing share of consumers being priced out of smartphone ownership.

But there’s no reason to think that this trend will stay confined to the poorest consumers. Companies higher up on the DRAM food chain are starting to feel the pain of higher memory prices; it won’t be long before the consumers of the rich world feel themselves being priced out of the electronics market.

We’re already seeing early signs of this. Samsung’s consumer division, for example, found itself unable to secure a long-term LPDDR agreement with Samsung’s memory division; it thus had to ship its Galaxy S26 phone with less memory than expected and at higher prices. This didn’t do much to help: Samsung executives warned that the company would record its first-ever annual net loss on smartphones. (More than balanced out, of course, by its enormous profits on memory.) We’re seeing the same repricing with Dell, which hiked laptop prices by 15 to 20 percent in December 2025.

Even Apple, the king of the electronics world, is starting to feel the bite of memory costs. Apple traditionally enjoyed significant bargaining power with the Korean memory makers, negotiating long-term agreements to smooth prices out for years at a time; but now the memory makers are the ones with the leverage. When Apple’s latest long-term agreement expired in January 2026, the memory makers refused anything lasting longer than a quarter at a time. In February, in order to secure supply at all, Apple agreed to pay Samsung a 100 percent premium on the LPDDR5X memory destined for the iPhone.

And so the pricing pressure on Apple has grown massively over the last six months. Over the course of 2025, the 12GB LPDDR5X chips that power the iPhone 17 Pro had increased in price by 230 percent; without its long-term agreements to protect it, Apple would face the full brunt of the memory crunch. In order to cope, Apple has announced a wave of delays over the last few months. The iPhone 18 standard model has been delayed to spring 2027; the new Mac Studio was delayed from summer to fall.

There’s no sign that things are going to get better anytime soon. Indeed, even if the memory makers stop reallocating wafer capacity to HBM, there will still be enormous pressure on LPDDR. In the last quarter of 2026, Nvidia will be launching its new Vera Rubin platform, a rack-scale AI supercomputer that pairs Rubin GPUs with Vera CPUs into a single system built for large-scale AI training and inference. The Vera CPUs will be enormously hungry for LPDDR: by 2027, Vera Rubin is projected to consume more LPDDR than Apple and Samsung combined. A report from JPMorgan projected that memory could account for 45 percent of the iPhone’s component cost by 2027, against roughly 10 percent today. Within the year Apple will be forced to make a decision: either it will cut into its margins to defend market share, or dramatically increase prices on its products.

All of which is to say: things are going to get a lot worse before they get better.

We’re already at the point where marginal buyers in the poor world are getting priced out of the smartphone market. We’re rapidly approaching the point where buyers in the rich world feel the same thing. In the short term, smartphone makers might be able to cope by significantly reducing the amount of memory per device, and thus degrading equipment performance, or simply by destroying demand through large price hikes. Margins for LPDDR and DDR have soared, and may even be higher than HBM margins: but so much HBM capacity has been secured via long-term agreements that there is no pivot to commodity DRAM coming anytime soon. If there’s any hope of relief, it’s coming from China. Upstart Chinese memory makers—like ChangXin Memory Technologies, which already commands more than 30 percent of China’s LPDDR market—are scaling up rapidly and hoping to fill the gap for DDR and LPDDR.

But as long as we are facing a shortage of memory for AI data centers, the economics of the DRAM shortage will be difficult to escape. Hyperscalers are simply willing to outbid budget phone manufacturers for access to DRAM: even ChangXin is planning to convert about 20 percent of its capacity to HBM.

So it will be hard to avoid a great repricing of consumer electronics in the coming years. We’re already in a world where poor-world consumers are getting priced out; we’re rapidly approaching a world in which rich-world consumers are getting priced out as well. The last few decades of technological progress democratized computing; but that era is now over. The long trend of consumer electronics getting faster, cheaper, and more powerful every year has reversed. The people who will feel it first, and feel it worst, are the world’s poor: but it won’t be too long before we, too, feel the crunch.

内存短缺正在引发消费电子产品的重新定价

结构化总结：内存短缺正在引发消费电子产品的重新定价

一句话摘要

核心论点

关键数据

因果链

章节要点

1. 内存墙

2. 寡头垄断与资本纪律

3. HBM 大淘金

4. 廉价智能手机消亡

5. 危机向高端蔓延

6. 唯一希望：中国长鑫存储

推荐理由（Simon Willison）

延伸思考

内存短缺正在引发消费电子产品的重新定价

内存短缺正在引发消费电子产品的重新定价

AI 正在杀死廉价智能手机

关键就在于你拥有的内存

HBM 大淘金

AI 正在吞噬廉价智能手机……

……也将吞噬昂贵的智能手机

The memory shortage is causing a repricing of consumer electronics

The memory shortage is causing a repricing of consumer electronics

AI is killing the cheap smartphone

AI is killing the cheap smartphone

It’s just the memory that you have

The great HBM rush

AI is eating the cheap smartphone…

…and will eat the expensive one too