Skip to content
// FLEET
us-west-1GB300 · liquid
eu-central-1B300 · liquid
apac-sg-1GB200 · NVL72
me-uae-1VR200 · Rubin
// SEGAL CLOUD
control plane online

GPU compute, on tap.

On-demand, reserved, and spot access to NVIDIA GPU compute. On-demand, reserved, or spot — billed per GPU-second with the region multiplier applied at metering time.

// CAPABILITIES
01

Instances

Launch a GPU in a region from a template. Start, stop, resize, terminate. Cost-so-far meter.

02

Clusters

Multi-node NVLink / InfiniBand domains with shared volumes and chosen topology.

03

Managed endpoints

Deploy a vLLM or TensorRT-LLM endpoint to an HTTPS URL. Autoscale, scale-to-zero.

04

Storage

Persistent block volumes — attach, detach, snapshot — and S3-compatible buckets.

05

Networking

Private VPC, public IPs, firewall rules, and region peering.

06

Monitoring

GPU util, VRAM, temp, power, NVLink throughput, tokens/sec. Alert thresholds.

// AVAILABLE ON-DEMAND
// GET STARTED

Launch a GPU in minutes.

On-demand, reserved, or spot — billed per GPU-second with the region multiplier applied at metering time.