Pricing
Lepton AI pricing context
Human-reviewed pricing summary paired with DevTune’s public AI search visibility benchmark.
Reviewed pricing summary
- Pre-acquisition, Lepton AI offered consumption-based per-token pricing for serverless LLM inference and hourly GPU rental rates for dedicated instances.
- Third-party benchmark analysis placed Lepton AI's blended per-token cost for Llama 3.1 70B at approximately $0.80 per 1 million tokens, comparable to Together AI ($0.88) and Fireworks AI ($0.90).
- The platform also offered GPU-backed dedicated instances with competitive hourly rates.
- Post-acquisition pricing is managed through NVIDIA DGX Cloud Lepton partner marketplaces (CoreWeave, Lambda, Nebius, etc.) and is not centrally published; current pricing should be verified directly with NVIDIA or individual cloud partners.
Benchmark context
#9
of 10 in LLM Inference & Serverless GPU
0.0%
AI search visibility
Sources and verification
Pricing changes often. Treat this page as evaluation context and verify contract terms, usage limits, and add-ons against the vendor’s current materials before making a buying decision.