NVIDIA GB300 NVL72: 50x Faster AI with Blackwell Ultra GPUs

NVIDIA GB300 NVL72 is a test-time scaling inference platform with 36 Arm-based NVIDIA Grace CPUs and 72 NVIDIA Blackwell Ultra GPUs

NVIDIA Hopper platform, AI factories with GB300 NVL72, Quantum-X800 InfiniBand or Spectrum-X Ethernet, and ConnectX-8 SuperNICS infer reasoning models 50x more

Compared to NVIDIA Blackwell GPUs, NVIDIA Blackwell Ultra's Tensor Cores have 1.5 times more AI computation FLOPS and 2 times the attention-layer acceleration

NVIDIA Blackwell Ultra GPUs provide 1.5x bigger HBM3e memory, increasing AI reasoning throughput for the longest context lengths

The NVIDIA ConnectX-8 SuperNIC's input/output (IO) module houses two ConnectX-8 devices, giving each GPU in the NVIDIA GB300 NVL72 800 Gb/s of network access

It offers twice the energy efficiency of the top server processors of today along with exceptional performance and memory bandwidth

AI reasoning models can achieve faster performance to the fifth-generation NVIDIA NVLink scale-up interconnect

A massive GPU designed for the era of artificial intelligence reasoning is created by combining 18 superchips using NVIDIA NVLink Switch technology and NVIDIA BlueField-3 DPUs