Generative AI SuperCluster

End-to-End AI Data Center Solutions

In the era of AI, a unit of compute is no longer measured by just the number of servers. Interconnected GPUs, CPUs, memory, storage, and these resources across multiple nodes in racks construct today's artificial intelligence. The infrastructure requires high-speed and low-latency network fabrics, and carefully designed cooling technologies and power delivery to sustain optimal performance and efficiency for each data center environment. Supermicro’s SuperCluster solution provides end-to-end AI data center solutions for rapidly evolving Generative AI and Large Language Models (LLMs).

Complete Integration at Scale
Design and build of full racks and clusters with a global manufacturing capacity of up to 5,000 racks per month
Test, Validate, Deploy with On-site Service
Proven L11, L12 testing processes thoroughly validate the operational effectiveness and efficiency before shipping
Liquid Cooling/Air Cooling
Fully integrated liquid-cooling or air cooling solution with GPU & CPU cold plates, Cooling Distribution Units and Manifolds
Supply and Inventory Management
One-stop-shop to deliver fully integrated racks fast and on-time to reduce time-to-solution for rapid deployment

AI SuperClusters

The full turn-key data center solution accelerates time-to-delivery for mission-critical enterprise use cases, and eliminates the complexity of building a large cluster, which previously was achievable only through the intensive design tuning and time-consuming optimization of supercomputing.

Liquid-Cooled 2-OU NVIDIA HGX B300 AI Cluster

Fully integrated liquid-cooled 144-node cluster with up to 1152 NVIDIA B300 GPUs

Unmatched AI training performance density from NVIDIA HGX B300 with compact 2-OU liquid-cooled system nodes
Supermicro Direct Liquid Cooling featuring 1.8MW capacity in-row CDUs (in-rack CDU options available)
Large HBM3e GPU memory capacity (288GB* of HBM3e memory per GPU) and system memory footprint for foundation model training
Scale-out with NVIDIA Quantum-X800 InfiniBand for ultra-low-latency, high-bandwidth AI fabrics
Dedicated storage fabric options with full NVIDIA GPUDirect RDMA and Storage or RoCE support
Designed to fully support NVIDIA AI Software Platforms, including NVIDIA AI Enterprise and NVIDIA Run:ai

* Physical GPU memory

Download Datasheet

Compute Node

Supermicro 2-OU Liquid-Cooled NVIDIA® HGX™ B300 8-GPU System (SYS-222GS-NB3OT-ALC)

SYS-222GS-NB3OT-ALC

144 NVIDIA® HGX™ B300 8-GPU, 2-OU Liquid-cooled Supermicro Systems (1152 GPUs) in 11 Racks along with two in-row Cooling Distribution Units (CDUs)

Liquid-Cooled 4U NVIDIA HGX B300 AI Cluster

Fully integrated liquid-cooled 72-node cluster with up to 576 NVIDIA B300 GPUs

Deploy high-performance AI training and inference with NVIDIA HGX B300 optimized for compute density and serviceability
Supermicro Direct Liquid Cooling designed for sustained high-power operation and improved energy efficiency
Large HBM3e GPU memory capacity (288GB* of HBM3e memory per GPU) and system memory footprint for foundation model training
Scale-out with NVIDIA Spectrum™-X Ethernet or NVIDIA Quantum-X800 InfiniBand
Dedicated storage fabric options with full NVIDIA GPUDirect RDMA and Storage or RoCE support
Designed to fully support NVIDIA AI Software Platforms, including NVIDIA AI Enterprise and NVIDIA Run:ai

* Physical GPU memory

Download Datasheet

Compute Node

Supermicro 4U Liquid-Cooled NVIDIA® HGX™ B300 8-GPU System (SYS-422GS-NB3RT-ALC or SYS-422GS-NB3RT-LCC)

SYS-422GS-NB3RT-ALC

72 NVIDIA® HGX™ B300 8-GPU, 4U Liquid-cooled Supermicro Systems (576 GPUs) in 11 Racks

Air-Cooled 8U NVIDIA HGX B300 AI Cluster

Fully integrated air-cooled 32-node cluster with up to 256 NVIDIA HGX B300 GPUs and 73.7TB total HBM3e memory

Full-stack solutions based on reference architectures including Supermicro systems, NVIDIA GPUs, NVIDIA software, and NVIDIA networking
Up to 256x NVIDIA HGX B300 GPUs providing up to 73.7TB total HBM3e memory (288GB HBM3e per GPU*)
Compatibility with the NVIDIA software stack (NVIDIA AI Enterprise and NVIDIA Run:ai)
Plug-and-play solution with systems fully integrated into racks and tested before shipment and on-site deployment
Scale-out with NVIDIA Spectrum-X Ethernet Compute Fabric or NVIDIA Quantum-X800 InfiniBand; Converged Network and Out-of-Band Management included
Supermicro AI Factory solutions endorsed by NVIDIA for Infrastructure Configuration, Spectrum-X networking, and Software Reference Stack based on the NVIDIA Enterprise Reference Architecture for HGX B300

* Physical GPU memory

NVIDIA® HGX™ B300 8-GPU — NVIDIA HGX B300 Air-Cooled

Learn More Download Datasheet

Compute Node

Supermicro 8U Air-Cooled NVIDIA® HGX™ B300 8-GPU System (SYS-822GS-NB3RT or AS -8126GS-NB3RT)

72 NVIDIA® HGX™ B300 8-GPU, 8U Air-cooled Supermicro Systems (576 GPUs) in 20 Racks

Liquid-Cooled NVIDIA HGX B200 AI Cluster

With up to 32 NVIDIA HGX B200 8-GPU, 4U Liquid-cooled Systems (256 GPUs) in 5 Racks

Deploy the pinnacle of AI training and inference performance with 256 NVIDIA B200 GPUs in one scalable unit (5 racks)
Supermicro Direct Liquid Cooling featuring 250kW capacity in-rack Coolant Distribution Unit (CDU) with redundant PSU and dual hot-swap pumps
45 TB of HBM3e memory in one scalable unit
Scale-out with 400Gb/s NVIDIA Spectrum-X Ethernet or NVIDIA Quantum-2 InfiniBand
Dedicated storage fabric options with full NVIDIA GPUDirect RDMA and Storage or RoCE support
Designed to fully support NVIDIA AI Software Platforms, including NVIDIA AI Enterprise and NVIDIA Run:ai

Download Datasheet

Compute Node

Supermicro 4U Liquid-Cooled 8-GPU System (SYS-422GA-NBRT-LCC or AS -4126GS-NBR-LCC)

32 NVIDIA HGX B200 8-GPU, 4U Liquid-cooled Systems (256 GPUs) in 5 Racks

Air-Cooled NVIDIA HGX B200 AI Cluster

With 32 NVIDIA HGX B200 8-GPU, 10U Air-cooled Systems (256 GPUs) in 9 Racks

Proven industry leading architecture with new thermally-optimized air-cooled system platform
45 TB of HBM3e memory in one scalable unit
Scale-out with 400Gb/s NVIDIA Spectrum-X Ethernet or NVIDIA Quantum-2 InfiniBand
Dedicated storage fabric options with full NVIDIA GPUDirect RDMA and Storage or RoCE support
NVIDIA Certified system nodes, fully supporting NVIDIA AI Software Platforms, including NVIDIA AI Enterprise and NVIDIA Run:ai

Learn More Download Datasheet

Compute Node

Supermicro 10U Air-Cooled 8-GPU System (SYS-A22GA-NBRT or AS -A126GS-TNBR)

32 NVIDIA® HGX™ B200 8-GPU, 10U Air-cooled Supermicro Systems (256 GPUs) in 9 Racks

NVIDIA GB300 NVL72

Liquid-cooled Exascale Compute in a Single Rack

Rack-scale solution with NVIDIA GB300 Grace™ Blackwell Superchip providing 72 NVIDIA B300 GPUs and 36 Grace CPUs per rack
NVIDIA Blackwell Ultra with 288GB HBM3e per GPU
Direct Liquid-Cooling for up to 40% reduction in electricity cost for the data center
Comprehensive Service from consultation to full-scale deployment, providing all necessary parts, networking solutions, and onsite installation services
Scale-out with 400Gb/s NVIDIA Spectrum-X Ethernet or NVIDIA Quantum-2 InfiniBand
Up to 800Gb/s NVIDIA Quantum-2 InfiniBand or Spectrum-X Ethernet with integrated NVIDIA ConnectX®-8 SuperNICs

Download Datasheet

Rack Solution

Supermicro NVIDIA GB300 NVL72 SuperCluster (Liquid-to-Liquid)

SRS-GB300-NVL72

72 NVIDIA® GB300 Grace Blackwell Superchip–based Supermicro compute trays in 5 racks

NVIDIA GB200 NVL72

Liquid-cooled Exascale Compute in a Single Rack

72x NVIDIA Blackwell B200 GPUs acting as one GPU with a massive pool of HBM3e memory (13.5TB per rack)
9x NVLink Switch, 4 ports per compute tray connecting 72 GPUs to provide 1.8TB/s GPU-to-GPU interconnect
Supermicro Direct Liquid Cooling featuring 250kW capacity in-rack Coolant Distribution Unit (CDU) with redundant PSU and dual hot-swap pumps
Dedicated storage fabric options with full NVIDIA GPUDirect RDMA and Storage or RoCE support
Scale-out with 400Gb/s NVIDIA Spectrum-X Ethernet or NVIDIA Quantum-2 InfiniBand
Designed to fully support NVIDIA AI Software Platforms, including NVIDIA AI Enterprise and NVIDIA Run:ai

Download Datasheet

Rack Solution

Supermicro NVIDIA GB200 NVL72 SuperCluster (Liquid-to-Liquid)

SRS-GB200-NVL72

72 NVIDIA® GB200 Grace Blackwell Superchip–based Supermicro compute trays in 5 racks

NVIDIA RTX PRO™ SuperCluster

AI Factory Solutions with NVIDIA RTX PRO 6000 Blackwell Server Edition

Full-stack solutions based on reference architectures including Supermicro systems, NVIDIA GPUs, NVIDIA software, and NVIDIA networking
Up to 256x NVIDIA RTX PRO 6000 Blackwell Server Edition GPUs providing up to 24TB GDDR7 memory
Compatibility with the NVIDIA software stack (NVIDIA AI Enterprise, NVIDIA Omniverse, and NVIDIA Run:ai)
Plug-and-play solution with systems fully integrated into racks and tested before shipment and on-site deployment
NVIDIA Spectrum-X Ethernet Compute Fabric, Converged Network, and Out-of-Band Management included
Supermicro AI Factory solutions endorsed by NVIDIA for Infrastructure Configuration, Spectrum-X networking, and Software Reference Stack based on the NVIDIA Enterprise Reference Architecture for RTX PRO 6000 Blackwell Server Edition

Learn More Download Datasheet

Compute Node

Leading Liquid-Cooled AI Cluster

With 32 NVIDIA HGX H200 8-GPU, 4U Liquid-cooled Systems (256 GPUs) in 5 Racks

Doubling compute density through Supermicro’s custom liquid-cooling solution with up to 40% reduction in electricity cost for data center
256 NVIDIA H200 GPUs in one scalable unit
36TB of HBM3e with H200 in one scalable unit
Dedicated storage fabric options with full NVIDIA GPUDirect RDMA and Storage or RoCE support
Scale-out with 400Gb/s NVIDIA Spectrum-X Ethernet or NVIDIA Quantum-2 InfiniBand
NVIDIA Certified system nodes, fully supporting NVIDIA AI Software Platforms, including NVIDIA AI Enterprise and NVIDIA Run:ai

Download Datasheet

Compute Node

Supermicro 4U Liquid-Cooled 8-GPU System (SYS-421GE-TNHR2-LCC or AS -4125GS-TNHR2-LCC)

32 NVIDIA® HGX™ H200 8-GPU, 4U Liquid-cooled Supermicro Systems (256 GPUs) in 5 Racks

Proven Design

With 32 NVIDIA HGX H200 8-GPU, 8U Air-cooled Systems (256 GPUs) in 9 Racks

Proven industry leading architecture for large scale AI infrastructure deployments
256 NVIDIA H200 GPUs in one scalable unit
36TB of HBM3e with H200 in one scalable unit
Scale-out with 400Gb/s NVIDIA Spectrum-X Ethernet or NVIDIA Quantum-2 InfiniBand
Customizable AI data pipeline storage fabric with industry leading parallel file system options
NVIDIA Certified system nodes, fully supporting NVIDIA AI Software Platforms, including NVIDIA AI Enterprise and NVIDIA Run:ai

Download Datasheet

Compute Node

Supermicro 8U Air-Cooled 8-GPU System (SYS-821GE-TNHR or AS -8125GS-TNHR)

32 NVIDIA® HGX™ H200 8-GPU, 8U Air-cooled Supermicro Systems (256 GPUs) in 9 Racks

Featured Resources

AI Infrastructure

Data Center Building Block Solutions® (DCBBS)

AI Factory

Edge AI

AI Storage

Industry AI Solutions

NVIDIA Solutions

AMD Solutions

Intel Solutions

Rackmount Servers

1U Dual Processor

2U Dual Processor

Single Processor

Multi-Processor

Product Families

GPU Servers

8U/10U GPU Lines

4U/5U GPU Lines

2U GPU Lines

1U GPU Lines

Twin Servers

FlexTwin™

BigTwin®

GrandTwin®

TwinPro®

FatTwin®

Blade Servers

SuperBlade®

MicroBlade®

MicroCloud

Storage Servers

All Storage Systems

All-Flash NVMe

Top-Loading Storage

JBOF

Petascale Grace Storage

Enterprise-Optimized Storage

JBOD Storage Enclosures

Motherboards

Server Boards

Workstation Boards

Embedded / IoT Boards

Desktop / Gaming Boards

Motherboard Matrix

Global SKUs

Chassis

1U Chassis

2U Chassis

3U Chassis

4U / Tower Chassis

Mid / Mini-Tower

Embedded / IoT Chassis

Mobile Racks / Drive Kits

JBOD Storage Enclosures

Global SKUs

SuperRack®

Rack Integration Service

Accessories

Cable Matrix

Riser Card Matrix

Storage AOC Matrix

Power Supply Matrix

Heatsink Matrix

System Fan Matrix

Mobile Racks / Drive Kits

Front Chassis Bezels

Storage, I/O, Security

Edge AI and IoT Systems

Compact Edge Systems

Compact Edge Servers

Rackmount Edge Servers

Embedded Components

Embedded Motherboards

Embedded Chassis

Switches

Adapters

SuperWorkstations

Liquid-Cooled AI Development Platform

Single-Processor

Dual-Processor