Pricing
Pay for what you use. Nothing more.
Every plan includes a monthly usage allowance measured in real model spend. Exceed it and you only pay standard per-token rates — no overage penalties, no surprises.
Free
$0.00 /mo
Kick the tires. No card required.
- $1 of model usage / month
- 10 requests / minute
- 1 API key
- Access to all open models
- Web playground
- Community support
Starter
$19.00 /mo
For indie hackers and side projects.
- $25 of model usage included
- 60 requests / minute
- 5 API keys
- All models incl. vision & image
- Usage analytics & alerts
- Email support
- Pay-as-you-go overage
Most popular
Pro
$99.00 /mo
For production apps and growing teams.
- $150 of model usage included
- 300 requests / minute
- 25 API keys
- Priority routing & lower latency
- Per-key usage breakdowns
- Priority email support
- Pay-as-you-go overage
- 99.9% uptime target
Scale
$399.00 /mo
For high-volume, latency-sensitive workloads.
- $750 of model usage included
- 1,200 requests / minute
- 200 API keys
- Highest-priority routing
- Volume discounts on overage
- Dedicated support channel
- SSO & audit logs (on request)
- Custom SLAs available
Prices in USD. Cancel or change plans any time. Need higher volume or an SLA? Talk to us.
Per-token model rates
Usage is billed against your plan's included allowance at these rates (USD per 1M tokens).
| Model | Publisher | Input / 1M | Output / 1M |
|---|---|---|---|
DeepSeek V4 Pro deepseek-ai/deepseek-v4-pro | $0.55 | $2.19 | |
Nemotron Super 49B nvidia/llama-3.3-nemotron-super-49b-v1.5 | $0.35 | $0.40 | |
Nemotron Nano 9B nvidia/nvidia-nemotron-nano-9b-v2 | $0.10 | $0.10 | |
Llama 3.3 70B Instruct meta/llama-3.3-70b-instruct | $0.20 | $0.20 | |
Llama 4 Maverick meta/llama-4-maverick-17b-128e-instruct | $0.60 | $0.60 | |
Llama 3.1 8B Instruct meta/llama-3.1-8b-instruct | $0.05 | $0.05 | |
Mistral Large 3 mistralai/mistral-large-3-675b-instruct-2512 | $0.90 | $2.70 | |
Mistral Medium 3.5 mistralai/mistral-medium-3.5-128b | $0.40 | $0.40 | |
Qwen3 Coder 480B qwen/qwen3-coder-480b-a35b-instruct | $0.20 | $0.80 | |
Llama 3.2 90B Vision meta/llama-3.2-90b-vision-instruct | $0.35 | $0.40 | |
Llama 3.2 11B Vision meta/llama-3.2-11b-vision-instruct | $0.06 | $0.06 | |
Phi-4 Multimodal microsoft/phi-4-multimodal-instruct | $0.08 | $0.16 |
Embeddings are billed per 1M input tokens and image models per image — see the full catalog.
Still deciding?
Start on the free plan — no card required — and upgrade only when you need more.