华为384超节点 对比 英伟达 GB200 NVL72。

以下是华为云矩阵384超节点(Supernode 384)NVIDIA GB200 NVL72的架构、性能、互联、内存、功耗及能效对比:

1. 对比表格:华为云矩阵384 vs NVIDIA GB200 NVL72

|------------------|-------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------|
| 特性 | 华为云矩阵384(Supernode 384) | NVIDIA GB200 NVL72 |
| 计算芯片 | 384颗昇腾910C NPU(分布于12个计算柜+4个总线柜)(来源:[TechWire Asia][1]、[路透社][2] | 72颗GB200 Blackwell GPU + 36颗Grace CPU(来源:[NVIDIA][3]、[维基百科][4] |
| 峰值算力(稠密BF16) | 约300 PFLOPS------接近英伟达方案的两倍(来源:[techblog.comsoc.org][5]、[Tom's Hardware][6]、[TechRadar][7] | 约180 PFLOPS(来源:[techblog.comsoc.org][5]、[TechRadar][7] |
| 内存容量 | 约48 TB高带宽内存(来源:[TechWire Asia][1]、[金融时报][8] | 约13--13.5 TB HBM3e(来源:[NVIDIA][3]、[维基百科][4] |
| 内存带宽 | 比GB200 NVL72高约2.1倍(来源:[techblog.comsoc.org][5]、[Tom's Hardware][6] | HBM总带宽最高576 TB/s(来源:[NVIDIA][3]、[Together AI][9] |
| 互联架构 | 光互联网状架构(6,912个800 Gb/s硅光LPO模块)(来源:[QSFPTEK][10]、[Tom's Hardware][6]、[路透社][2] | 第五代NVLink + NVLink交换架构;高速电互联(来源:[数据挖掘][11]、[SemiAnalysis][12]、[维基百科][13]、[NVIDIA][3] |
| 系统功耗 | 约559 kW(来源:[Capacity Media][14]、[金融时报][8]、[Tom's Hardware][6] | 约145 kW(或单机柜约132 kW)(来源:[Tom's Hardware][6]、[HPE商店][15] |
| 能效(性能/瓦特) | 能效比GB200 NVL72低约2.3倍(来源:[Tom's Hardware][6]、[SemiAnalysis][16] | 能效高约2.3倍(来源:[Tom's Hardware][6]、[SemiAnalysis][16] |
| 显著特点 | 内存容量高3.6倍,内存带宽高2.1倍,采用光互联,功耗较高(来源:[TechRadar][7]、[金融时报][8]、[Tom's Hardware][6]、[区块链委员会][17] | 超高密度NVLink交换架构,液冷高效设计(来源:[HPE商店][15]、[primeline-solutions.com][18]、[SemiAnalysis][12] |

2. 核心结论

华为云矩阵384通过规模优势实现原始算力和内存领先:*384颗昇腾NPU对比英伟达72颗GB200 GPU*,提供近300 PFLOPS算力、显著更高的内存容量和带宽。但其功耗需求更高(约559 kW vs 约145 kW)且能效较低。全光互联设计以复杂性和能耗为代价实现大规模网状架构(来源:[Tom's Hardware][6]、[TechRadar][7]、[金融时报][8]、[QSFPTEK][10])。

NVIDIA GB200 NVL72 通过第五代NVLink和NVLink交换架构专注于能效和紧密电互联,以高密度、高能效封装提供卓越性能。其设计旨在最小化延迟并最大化一致性,但在内存和峰值吞吐量上低于华为方案(来源:[数据挖掘][11]、[SemiAnalysis][12]、[NVIDIA][3]、[primeline-solutions.com][18]、[HPE商店][15])。

3. 最终总结

华为云矩阵384 采用光学超节点设计强调极致规模与密度------适合电力不受限且无法获取英伟达替代方案的场景。
NVIDIA GB200 NVL72 提供卓越能效和系统优雅性,适合重视每瓦特吞吐量和精简电互联设计的超大规模AI设施。

* [Tom's Hardware](https://www.tomshardware.com/tech-industry/artificial-intelligence/huaweis-new-ai-cloudmatrix-cluster-beats-nvidias-gb200-by-brute-force-uses-4x-the-power?utm_source=chatgpt.com)

* [TechRadar](https://www.techradar.com/pro/no-nvidia-no-problem-huawei-debuts-ai-system-thats-apparently-faster-than-the-market-leader-the-gb200-nvl72?utm_source=chatgpt.com)

* [金融时报](https://www.ft.com/content/cac568a2-5fd1-455c-b985-f3a8ce31c097?utm_source=chatgpt.com)

1\]: https://techwireasia.com/2025/05/huawei-supernode-384-sanctions-nvidia-ai-computing/?utm_source=chatgpt.com "超节点384:华为如何将制裁转化为对英伟达的威胁" \[2\]: https://www.reuters.com/world/china/huawei-shows-off-ai-computing-system-rival-nvidias-top-product-2025-07-26/?utm_source=chatgpt.com "华为展示AI计算系统对标英伟达顶级产品" \[3\]: https://www.nvidia.com/en-us/data-center/gb200-nvl72/?utm_source=chatgpt.com "GB200 NVL72 \| NVIDIA" \[4\]: https://en.wikipedia.org/wiki/Nvidia_DGX?utm_source=chatgpt.com "英伟达DGX" \[5\]: https://techblog.comsoc.org/2025/07/27/huawei-launches-cloudmatrix-384-ai-system-said-to-compete-with-nvidias-most-advanced-ai-system/?utm_source=chatgpt.com "华为发布云矩阵384 AI系统对标英伟达最先进方案" \[6\]: https://www.tomshardware.com/tech-industry/artificial-intelligence/huaweis-new-ai-cloudmatrix-cluster-beats-nvidias-gb200-by-brute-force-uses-4x-the-power?utm_source=chatgpt.com "华为云矩阵AI集群以暴力计算超越英伟达GB200,功耗达4倍" \[7\]: https://www.techradar.com/pro/no-nvidia-no-problem-huawei-debuts-ai-system-thats-apparently-faster-than-the-market-leader-the-gb200-nvl72?utm_source=chatgpt.com "无需英伟达?华为推出AI系统宣称超越市场领导者GB200 NVL72" \[8\]: https://www.ft.com/content/cac568a2-5fd1-455c-b985-f3a8ce31c097?utm_source=chatgpt.com "华为向被切断英伟达供应的中国客户交付先进AI芯片集群" \[9\]: https://www.together.ai/nvidia-gb200-nvl72?utm_source=chatgpt.com "NVIDIA GB200 NVL72集群" \[10\]: https://www.qsfptek.com/en/qt-news/400g-osfp-siph-lpos-in-huawei-ai-cloudmatrix384-super-node.html?utm_source=chatgpt.com "400G OSFP - 华为云矩阵384超节点中的硅光LPO模块" \[11\]: https://datacrunch.io/blog/nvidia-gb200-nvl72-for-ai-training-and-inference?utm_source=chatgpt.com "NVIDIA GB200 NVL72用于AI训练与推理" \[12\]: https://semianalysis.com/2024/04/10/nvidia-blackwell-perf-tco-analysis/?utm_source=chatgpt.com "英伟达Blackwell性能与总拥有成本分析" \[13\]: https://de.wikipedia.org/wiki/Blackwell_%28Grafikprozessor%29?utm_source=chatgpt.com "Blackwell(图形处理器)" \[14\]: https://www.capacitymedia.com/article-huawei-unveils-cloudmatrix-384?utm_source=chatgpt.com "华为发布云矩阵384挑战英伟达AI霸主地位" \[15\]: https://buy.hpe.com/us/en/Compute/Rack-Scale-System/Nvidia-NVL-System/Nvidia-NVL-System/NVIDIA-GB200-NVL72-by-HPE/p/1014890104?utm_source=chatgpt.com "HPE提供的NVIDIA GB200 NVL72系统" \[16\]: https://semianalysis.com/2025/04/16/huawei-ai-cloudmatrix-384-chinas-answer-to-nvidia-gb200-nvl72/?utm_source=chatgpt.com "华为云矩阵384------中国对英伟达GB200 NVL72的回应" \[17\]: https://www.blockchain-council.org/ai/huawei-launches-cloudmatrix-384-ai-system/?utm_source=chatgpt.com "华为发布云矩阵384 AI系统" \[18\]: https://www.primeline-solutions.com/media/categories/server/nach-gpu/nvidia-hgx-h200/nvidia-blackwell-b200-datasheet.pdf?utm_source=chatgpt.com "英伟达Blackwell B200技术白皮书" --- (注:所有数据来源均基于英文原报道,实际性能可能因测试环境与配置存在差异