Nvidia's GB300 NVL72 Achieves 20x Efficiency Over H200 for AI Workloads
Nvidia's GB300 NVL72 system delivers 61,400 concurrent agents per megawatt, marking a significant efficiency advancement for data centers. This 20x improvement over the H200 enhances the economic viability of AI inference amidst rising energy costs.

The Nvidia GB300 NVL72 system can handle 61,400 concurrent AI agents per megawatt, representing a 20-fold increase in efficiency compared to the previous H200 model. Built on the Blackwell Ultra architecture, it incorporates 72 GPUs and 36 CPUs within a liquid-cooled rack, featuring approximately 20-21 TB of HBM3e memory and 130 TB/s NVLink bandwidth.
Performance metrics indicate a 50x output increase over older Hopper systems and enhanced throughput. Key commitments include large-scale deployments by Microsoft Azure for OpenAI workloads starting in late 2025. This efficiency gain presents a compelling ROI proposition for data center operators, and systems that optimize energy consumption may attract more investment as ESG considerations rise.




Comments