NVIDIA GB300 NVL72: 50x Faster AI with Blackwell Ultra GPUs
NVIDIA GB300 NVL72 is a test-time scaling inference platform with 36 Arm-based NVIDIA Grace CPUs and 72 NVIDIA Blackwell Ultra GPUs
NVIDIA Hopper platform, AI factories with GB300 NVL72, Quantum-X800 InfiniBand or Spectrum-X Ethernet, and ConnectX-8 SuperNICS infer reasoning models 50x more
Compared to NVIDIA Blackwell GPUs, NVIDIA Blackwell Ultra's Tensor Cores have 1.5 times more AI computation FLOPS and 2 times the attention-layer acceleration
NVIDIA Blackwell Ultra GPUs provide 1.5x bigger HBM3e memory, increasing AI reasoning throughput for the longest context lengths
The NVIDIA ConnectX-8 SuperNIC's input/output (IO) module houses two ConnectX-8 devices, giving each GPU in the NVIDIA GB300 NVL72 800 Gb/s of network access
It offers twice the energy efficiency of the top server processors of today along with exceptional performance and memory bandwidth
AI reasoning models can achieve faster performance to the fifth-generation NVIDIA NVLink scale-up interconnect
A massive GPU designed for the era of artificial intelligence reasoning is created by combining 18 superchips using NVIDIA NVLink Switch technology and NVIDIA BlueField-3 DPUs