Catalog

16 models, one API key

Every model is served over an OpenAI-compatible endpoint and priced per token. Filter by capability, copy the model id, and start calling Speka in minutes.

DeepSeek logo

DeepSeek V4 Pro

DeepSeek
reasoning

State-of-the-art open reasoning model with transparent chain-of-thought. Excels at math, logic and multi-step problem solving.

deepseek-ai/deepseek-v4-pro
Input / 1M
$0.55
Output / 1M
$2.19
Open model
NVIDIA logo

Nemotron Super 49B

NVIDIA
reasoning

NVIDIA's reasoning-tuned Nemotron model — strong math, logic and agentic tool-use at an efficient size.

nvidia/llama-3.3-nemotron-super-49b-v1.5
Input / 1M
$0.35
Output / 1M
$0.40
Open model
NVIDIA logo

Nemotron Nano 9B

NVIDIA
reasoning

Compact reasoning model that punches above its weight on math and coding benchmarks. Fast and cheap.

nvidia/nvidia-nemotron-nano-9b-v2
Input / 1M
$0.10
Output / 1M
$0.10
Open model
Meta logo

Llama 3.3 70B Instruct

Meta
chat

Meta's flagship 70B instruct model — 405B-class quality at a fraction of the cost. Great default for production chat.

meta/llama-3.3-70b-instruct
Input / 1M
$0.20
Output / 1M
$0.20
Open model
Meta logo

Llama 4 Maverick

Meta
chat

Meta's latest flagship mixture-of-experts model. Frontier quality for demanding reasoning and generation.

meta/llama-4-maverick-17b-128e-instruct
Input / 1M
$0.60
Output / 1M
$0.60
Open model
Meta logo

Llama 3.1 8B Instruct

Meta
chat

Fast, cheap and capable. Ideal for high-volume classification, routing and lightweight chat.

meta/llama-3.1-8b-instruct
Input / 1M
$0.05
Output / 1M
$0.05
Open model
Mistral AI logo

Mistral Large 3

Mistral AI
chat

Mistral's top-tier model with excellent reasoning, function calling and 80+ language support.

mistralai/mistral-large-3-675b-instruct-2512
Input / 1M
$0.90
Output / 1M
$2.70
Open model
Mistral AI logo

Mistral Medium 3.5

Mistral AI
chat

Balanced mid-size Mistral model with strong multilingual and coding ability at efficient cost.

mistralai/mistral-medium-3.5-128b
Input / 1M
$0.40
Output / 1M
$0.40
Open model
Qwen logo

Qwen3 Coder 480B

Qwen
code

Best-in-class open code model. Excellent at generation, completion, refactoring and bug fixing.

qwen/qwen3-coder-480b-a35b-instruct
Input / 1M
$0.20
Output / 1M
$0.80
Open model
Meta logo

Llama 3.2 90B Vision

Meta
vision

Large multimodal model for image understanding, document Q&A, charts and visual reasoning.

meta/llama-3.2-90b-vision-instruct
Input / 1M
$0.35
Output / 1M
$0.40
Open model
Meta logo

Llama 3.2 11B Vision

Meta
vision

Lightweight vision-language model for fast image captioning, OCR and visual Q&A.

meta/llama-3.2-11b-vision-instruct
Input / 1M
$0.06
Output / 1M
$0.06
Open model
Microsoft logo

Phi-4 Multimodal

Microsoft
vision

Microsoft's compact multimodal model handling text, image and audio understanding.

microsoft/phi-4-multimodal-instruct
Input / 1M
$0.08
Output / 1M
$0.16
Open model
NVIDIA logo

NV-EmbedQA E5 v5

NVIDIA
embedding

Robust retrieval embeddings tuned for question answering and RAG pipelines.

nvidia/nv-embedqa-e5-v5
Input / 1M
$0.01
Open model
NVIDIA logo

NV-Embed v1

NVIDIA
embedding

High-accuracy general-purpose text embeddings for semantic search and clustering.

nvidia/nv-embed-v1
Input / 1M
$0.016
Open model
Black Forest Labs logo

FLUX.1 [dev]

Black Forest Labs
image

High-fidelity text-to-image generation with excellent prompt adherence and typography.

black-forest-labs/flux.1-dev
Per image
$0.04
Open model
Stability AI logo

Stable Diffusion 3.5 Large

Stability AI
image

Stability's flagship image model for photorealistic and artistic generation.

stabilityai/stable-diffusion-3.5-large
Per image
$0.05
Open model