Pricing

Cerebrium pricing context

Human-reviewed pricing summary paired with DevTune’s public AI search visibility benchmark.

Reviewed pricing summary

  • Per-second, usage-based billing for all compute.
  • GPU rates range from $0.000164/s (T4) to $0.00167/s (B200), with A10 at $0.000306/s and H100 at $0.000944/s.
  • Memory is billed at $0.00000222/GB/s; CPU at $0.00000655/vCPU/s.
  • Storage costs $0.05/GB/month (first 100 GB free).
  • Three plan tiers: Hobby (free base + compute, up to 3 apps, 5 concurrent GPUs), Standard ($100/month + compute, unlimited apps, 30 concurrent GPUs, custom domains), and Enterprise (custom pricing, unlimited concurrency, dedicated Slack, volume discounts, ML engineering services).
  • Volume discounts and capacity guarantees (e.g., up to 50 H100s with $10,000/month minimum spend) are available for enterprise deployments.

Benchmark context

#5

of 10 in LLM Inference & Serverless GPU

2.7%

AI search visibility

Sources and verification

Pricing changes often. Treat this page as evaluation context and verify contract terms, usage limits, and add-ons against the vendor’s current materials before making a buying decision.