◦ PRICING

Credit-based pricing.

1 credit = $0.001. Pay for what you use. Small models cost less, large models cost more.

FREE
$0forever

10,000 credits / mo · ~$10 value

For experimentation and prototyping.

  • ✓ 10 requests / min
  • ✓ 1 API key
  • ✓ All 30+ models
  • ✓ Community support
  • ✓ No credit card required
PROPOPULAR
$49/ month

500,000 credits / mo · ~$500 value

For production applications.

  • ✓ 100 requests / min
  • ✓ 5 API keys
  • ✓ Overage $1.00 / 1K credits
  • ✓ Email support
  • ✓ 30-day audit logs
  • ✓ Usage alerts + per-key breakdown
ENTERPRISEDEDICATED
Custom

Unlimited credits · negotiated rates

For regulated industries.

  • ✓ Custom rate limits
  • ✓ Unlimited API keys
  • ✓ Volume discount on credits
  • ✓ Priority model access
  • ✓ Dedicated support + SLA
  • ✓ 1-year audit logs + export
  • ✓ SOX / PIPEDA reporting
  • ✓ Dedicated model hosting

Credits per model

Smaller models = fewer credits. Pick the right model for your use case.

Chat · Reasoning · Code

MODELCATEGORYCREDITS / 1M INCREDITS / 1M OUT
GPT-OSS 20BLLM60230
Qwen 3 Coder 30BCode90330
GPT-OSS 120BLLM110590
Qwen 3 32BLLM110310
Mistral Small 3.2 24BLLM130390
Llama 3.1 8B InstructLLM140140
Mistral 7B Instruct v0.3LLM140140
Mistral Nemo 12BLLM180180
Mixtral 8x7B InstructLLM880880
Llama 3.3 70B InstructLLM930930
DeepSeek R1 Distill 70BReasoning930930

Embedding

MODELCREDITS / 1M TOKENS
BGE-M310
BGE Base EN v1.510
BGE Multilingual Gemma210
Qwen 3 Embedding 8B150

Vision · Speech · Image · Safety

MODELCATEGORYCREDITS
Qwen 2.5 VL 72BVision1,260 / M tokens
Whisper Large V3SpeechIncluded
Whisper Large V3 TurboSpeechIncluded
Stable Diffusion XL BaseImageIncluded
Qwen 3 Guard 8BSafetyIncluded
Qwen 3 Guard 0.6BSafetyIncluded

HOW CREDITS WORK

  1. 01

    Each API call costs credits based on the model and tokens used.

  2. 02

    Small models (Llama 8B, Mistral 7B) cost ~140 credits per million tokens.

  3. 03

    Large models (GPT-OSS 120B, Llama 70B) cost ~900 credits per million tokens.

  4. 04

    Embedding and safety models are ultra-cheap or included at no additional cost.

  5. 05

    Your dashboard shows real-time credit balance, burn rate, and per-key breakdown.