Skip to content
// FLEET
us-west-1GB300 · liquid
eu-central-1B300 · liquid
apac-sg-1GB200 · NVL72
me-uae-1VR200 · Rubin
// HOPPER
Available

H200 SXM

141 GB HBM3e for long-context inference without the spill.

Best for
long-context inferenceRAG
Available in 5 regions
us-west-1us-east-1eu-central-1apac-sg-1ap-ph-1
Pricing

$3.59/GPU-hr

Architecture
Hopper
Form factor
SXM
Memory
141 GB HBM3e
Bandwidth
4.8 TB/s
NVLink
Gen 4
Board power
700 W
// SOFTWARE STACK
CUDANVLinkTensorRT-LLMvLLMSGLangNVIDIA DynamoNVFP4KV-cachespeculative decoding