All Providers
295 Models
75 Families
2M Max Context
$0–$30 Input Cost/MTok
$0–$180 Output Cost/MTok
Capabilities
Reasoning
Tool Calling
Attachments
Open Weights
Setup
Set the following environment variable to use Vercel AI Gateway:
AI_GATEWAY_API_KEY Models (295)
| Model | Model ID | Input Cost | Output Cost | Context | Capabilities |
|---|---|---|---|---|---|
| Llama-4-Scout-17B-16E-Instruct-FP8 llama | meta/llama-4-scout | $0/MTok | $0/MTok | 128K | Tools Open |
| Llama-3.3-70B-Instruct llama | meta/llama-3.3-70b | $0/MTok | $0/MTok | 128K | Tools Open |
| Llama-4-Maverick-17B-128E-Instruct-FP8 llama | meta/llama-4-maverick | $0/MTok | $0/MTok | 128K | Tools Open |
| Nova Micro nova-micro | amazon/nova-micro | $0.04/MTok | $0.14/MTok | 128K | Tools |
| Ministral 3B (latest) ministral | mistral/ministral-3b | $0.04/MTok | $0.04/MTok | 128K | Tools Open |
| Trinity Mini trinity | arcee-ai/trinity-mini | $0.04/MTok | $0.15/MTok | 131.1K | |
| GPT OSS 20B gpt-oss | openai/gpt-oss-20b | $0.05/MTok | $0.20/MTok | 131.1K | Reasoning Tools Open |
| GPT-5 Nano gpt-nano | openai/gpt-5-nano | $0.05/MTok | $0.40/MTok | 400K | Reasoning Tools |
| Nemotron 3 Nano 30B A3B nemotron | nvidia/nemotron-3-nano-30b-a3b | $0.05/MTok | $0.24/MTok | 262.1K | Reasoning |
| GLM 4.7 FlashX glm-flash | zai/glm-4.7-flashx | $0.06/MTok | $0.40/MTok | 200K | Reasoning Tools Open |
| Nvidia Nemotron Nano 9B V2 nemotron | nvidia/nemotron-nano-9b-v2 | $0.06/MTok | $0.23/MTok | 131.1K | Reasoning Tools |
| Nova Lite nova-lite | amazon/nova-lite | $0.06/MTok | $0.24/MTok | 300K | Tools |
| GLM 4.7 Flash glm | zai/glm-4.7-flash | $0.07/MTok | $0.40/MTok | 200K | Reasoning Tools |
| gpt-oss-safeguard-20b gpt-oss | openai/gpt-oss-safeguard-20b | $0.07/MTok | $0.30/MTok | 131.1K | Reasoning Tools |
| StepFun 3.5 Flash step | stepfun/step-3.5-flash | $0.09/MTok | $0.30/MTok | 262.1K | Reasoning Tools |
| Devstral Small 1.1 devstral | mistral/devstral-small | $0.10/MTok | $0.30/MTok | 128K | Tools |
| Devstral Small 2 devstral | mistral/devstral-small-2 | $0.10/MTok | $0.30/MTok | 256K | Tools |
| Mistral Small (latest) mistral-small | mistral/mistral-small | $0.10/MTok | $0.30/MTok | 32K | Tools Open |
| Ministral 8B (latest) ministral | mistral/ministral-8b | $0.10/MTok | $0.10/MTok | 128K | Tools Open |
| Gemini 2.5 Flash Lite gemini-flash-lite | google/gemini-2.5-flash-lite | $0.10/MTok | $0.40/MTok | 1.0M | Reasoning Tools |
| GPT OSS 120B gpt-oss | openai/gpt-oss-120b | $0.10/MTok | $0.50/MTok | 131.1K | Reasoning Tools Open |
| GPT-4.1 nano gpt-nano | openai/gpt-4.1-nano | $0.10/MTok | $0.40/MTok | 1.0M | Tools |
| MiMo V2 Flash mimo | xiaomi/mimo-v2-flash | $0.10/MTok | $0.30/MTok | 262.1K | Reasoning Tools |
| Qwen 3.5 Flash qwen | alibaba/qwen3.5-flash | $0.10/MTok | $0.40/MTok | 1M | Reasoning Tools |
| Llama 3.2 1B Instruct llama | meta/llama-3.2-1b | $0.10/MTok | $0.10/MTok | 128K | |
| Qwen3-30B-A3B qwen | alibaba/qwen-3-30b | $0.12/MTok | $0.50/MTok | 41.0K | Reasoning Tools |
| Qwen3-14B qwen | alibaba/qwen-3-14b | $0.12/MTok | $0.24/MTok | 41.0K | Reasoning Tools |
| Gemma 4 31B IT gemma | google/gemma-4-31b-it | $0.14/MTok | $0.40/MTok | 262.1K | Tools |
| MiMo M2.5 mimo-v2.5 | xiaomi/mimo-v2.5 | $0.14/MTok | $0.28/MTok | 1.1M | Reasoning Tools |
| DeepSeek V4 Flash deepseek | deepseek/deepseek-v4-flash | $0.14/MTok | $0.28/MTok | 1M | Reasoning Tools Open |
| Mistral Nemo mistral-nemo | mistral/mistral-nemo | $0.15/MTok | $0.15/MTok | 128K | Tools |
| Pixtral 12B pixtral | mistral/pixtral-12b | $0.15/MTok | $0.15/MTok | 128K | Tools Open |
| Gemma 4 26B A4B IT gemma | google/gemma-4-26b-a4b-it | $0.15/MTok | $0.60/MTok | 262.1K | Reasoning Tools |
| GPT 4o Mini Search Preview gpt-mini | openai/gpt-4o-mini-search-preview | $0.15/MTok | $0.60/MTok | 128K | |
| GPT-4o mini gpt-mini | openai/gpt-4o-mini | $0.15/MTok | $0.60/MTok | 128K | Tools |
| NVIDIA Nemotron 3 Super 120B A12B nemotron | nvidia/nemotron-3-super-120b-a12b | $0.15/MTok | $0.65/MTok | 256K | Reasoning |
| Qwen3 Next 80B A3B Instruct qwen | alibaba/qwen3-next-80b-a3b-instruct | $0.15/MTok | $1.20/MTok | 131.1K | Tools Open |
| Qwen3 Next 80B A3B Thinking qwen | alibaba/qwen3-next-80b-a3b-thinking | $0.15/MTok | $1.20/MTok | 131.1K | Reasoning Tools Open |
| Qwen 3 Coder 30B A3B Instruct qwen | alibaba/qwen3-coder-30b-a3b | $0.15/MTok | $0.60/MTok | 262.1K | Reasoning Tools |
| Llama 3.2 3B Instruct llama | meta/llama-3.2-3b | $0.15/MTok | $0.15/MTok | 128K | |
| Qwen 3.32B qwen | alibaba/qwen-3-32b | $0.16/MTok | $0.64/MTok | 128K | Reasoning Tools |
| Llama 3.2 11B Vision Instruct llama | meta/llama-3.2-11b | $0.16/MTok | $0.16/MTok | 128K | Tools |
| Grok 4.1 Fast Reasoning grok | xai/grok-4.1-fast-reasoning | $0.20/MTok | $0.50/MTok | 1M | Reasoning Tools |
| Grok 4.1 Fast Non-Reasoning grok | xai/grok-4.1-fast-non-reasoning | $0.20/MTok | $0.50/MTok | 1M | Tools |
| Ministral 14B ministral | mistral/ministral-14b | $0.20/MTok | $0.20/MTok | 256K | |
| GPT 5.4 Nano gpt | openai/gpt-5.4-nano | $0.20/MTok | $1.25/MTok | 400K | Reasoning Tools |
| GLM 4.5 Air glm-air | zai/glm-4.5-air | $0.20/MTok | $1.10/MTok | 128K | Reasoning Tools Open |
| Nvidia Nemotron Nano 12B V2 VL nemotron | nvidia/nemotron-nano-12b-v2-vl | $0.20/MTok | $0.60/MTok | 131.1K | Reasoning Tools |
| Step 3.7 Flash step | stepfun/step-3.7-flash | $0.20/MTok | $1.15/MTok | 256K | Reasoning Tools |
| Qwen3 235B A22B Instruct 2507 qwen | alibaba/qwen-3-235b | $0.22/MTok | $0.88/MTok | 262.1K | Reasoning Tools |
| Llama 3.1 8B Instruct llama | meta/llama-3.1-8b | $0.22/MTok | $0.22/MTok | 128K | Tools |
| Gemini 3.1 Flash Lite gemini | google/gemini-3.1-flash-lite | $0.25/MTok | $1.50/MTok | 1M | Reasoning Tools |
| Gemini 3.1 Flash Lite Preview gemini | google/gemini-3.1-flash-lite-preview | $0.25/MTok | $1.50/MTok | 1M | Reasoning Tools |
| GPT-5.1 Codex mini gpt | openai/gpt-5.1-codex-mini | $0.25/MTok | $2/MTok | 400K | Reasoning Tools |
| GPT-5 Mini gpt-mini | openai/gpt-5-mini | $0.25/MTok | $2/MTok | 400K | Reasoning Tools |
| Seed 1.6 seed | bytedance/seed-1.6 | $0.25/MTok | $2/MTok | 256K | Reasoning Tools |
| Seed 1.8 seed | bytedance/seed-1.8 | $0.25/MTok | $2/MTok | 256K | Reasoning Tools |
| Mercury Coder Small Beta mercury | inception/mercury-coder-small | $0.25/MTok | $1/MTok | 32K | Tools |
| Mercury 2 mercury | inception/mercury-2 | $0.25/MTok | $0.75/MTok | 128K | Reasoning Tools |
| Claude Haiku 3 claude-haiku | anthropic/claude-3-haiku | $0.25/MTok | $1.25/MTok | 200K | Tools |
| Trinity Large Preview trinity | arcee-ai/trinity-large-preview | $0.25/MTok | $1/MTok | 131K | Tools |
| Trinity Large Thinking trinity | arcee-ai/trinity-large-thinking | $0.25/MTok | $0.90/MTok | 262.1K | Reasoning Tools Open |
| DeepSeek V3.1 Terminus deepseek | deepseek/deepseek-v3.1-terminus | $0.27/MTok | $1/MTok | 131.1K | Reasoning Tools Open |
| DeepSeek V3 0324 deepseek | deepseek/deepseek-v3 | $0.27/MTok | $1.12/MTok | 163.8K | Tools |
| DeepSeek V3.2 deepseek | deepseek/deepseek-v3.2 | $0.28/MTok | $0.42/MTok | 128K | |
| Codestral (latest) codestral | mistral/codestral | $0.30/MTok | $0.90/MTok | 256K | Tools Open |
| Nano Banana (Gemini 2.5 Flash Image) gemini-flash | google/gemini-2.5-flash-image | $0.30/MTok | $2.50/MTok | 32.8K | |
| Gemini 2.5 Flash gemini-flash | google/gemini-2.5-flash | $0.30/MTok | $2.50/MTok | 1.0M | Reasoning Tools |
| GLM-4.6V glm | zai/glm-4.6v | $0.30/MTok | $0.90/MTok | 128K | Reasoning Tools |
| Nova 2 Lite nova | amazon/nova-2-lite | $0.30/MTok | $2.50/MTok | 1M | Reasoning |
| MiniMax M2.5 minimax | minimax/minimax-m2.5 | $0.30/MTok | $1.20/MTok | 204.8K | Reasoning Tools |
| MiniMax M2.1 minimax | minimax/minimax-m2.1 | $0.30/MTok | $1.20/MTok | 204.8K | Reasoning Tools |
| MiniMax M3 minimax-m3 | minimax/minimax-m3 | $0.30/MTok | $1.20/MTok | 1M | Reasoning Tools Open |
| MiniMax M2 minimax | minimax/minimax-m2 | $0.30/MTok | $1.20/MTok | 205K | Reasoning Tools Open |
| Minimax M2.7 minimax | minimax/minimax-m2.7 | $0.30/MTok | $1.20/MTok | 204.8K | Reasoning Tools Open |
| MiniMax M2.1 Lightning minimax | minimax/minimax-m2.1-lightning | $0.30/MTok | $2.40/MTok | 204.8K | Reasoning Tools |
| KAT-Coder-Pro V1 kat-coder | kwaipilot/kat-coder-pro-v1 | $0.30/MTok | $1.20/MTok | 256K | Reasoning |
| Kat Coder Pro V2 kat-coder | kwaipilot/kat-coder-pro-v2 | $0.30/MTok | $1.20/MTok | 256K | Reasoning Tools |
| Mistral Medium 3.1 mistral-medium | mistral/mistral-medium | $0.40/MTok | $2/MTok | 128K | Tools |
| Devstral 2 devstral | mistral/devstral-2 | $0.40/MTok | $2/MTok | 256K | Tools |
| GPT-4.1 mini gpt-mini | openai/gpt-4.1-mini | $0.40/MTok | $1.60/MTok | 1.0M | Tools |
| Qwen3 VL Thinking qwen | alibaba/qwen3-vl-thinking | $0.40/MTok | $4/MTok | 131.1K | Reasoning Tools Open |
| Qwen 3.7 Plus qwen3.7-plus | alibaba/qwen3.7-plus | $0.40/MTok | $1.60/MTok | 1M | Reasoning Tools |
| Qwen3 VL Instruct qwen | alibaba/qwen3-vl-instruct | $0.40/MTok | $1.60/MTok | 131.1K | Tools Open |
| Qwen 3.5 Plus qwen | alibaba/qwen3.5-plus | $0.40/MTok | $2.40/MTok | 1M | Reasoning Tools |
| Qwen3 235B A22B Thinking 2507 qwen | alibaba/qwen3-235b-a22b-thinking | $0.40/MTok | $4/MTok | 131.1K | Reasoning Tools |
| Qwen3 VL 235B A22B Instruct qwen | alibaba/qwen3-vl-235b-a22b-instruct | $0.40/MTok | $1.60/MTok | 131.1K | |
| MiMo V2.5 Pro mimo-v2.5-pro | xiaomi/mimo-v2.5-pro | $0.43/MTok | $0.87/MTok | 1.1M | Reasoning Tools |
| DeepSeek V4 Pro deepseek | deepseek/deepseek-v4-pro | $0.43/MTok | $0.87/MTok | 1M | Reasoning Tools Open |
| Kimi K2 Thinking kimi-thinking | moonshotai/kimi-k2-thinking | $0.47/MTok | $2/MTok | 216.1K | Reasoning Tools |
| Mistral Large 3 mistral-large | mistral/mistral-large-3 | $0.50/MTok | $1.50/MTok | 256K | |
| Magistral Small magistral-small | mistral/magistral-small | $0.50/MTok | $1.50/MTok | 128K | Reasoning Tools Open |
| Gemini 3.1 Flash Image (Nano Banana 2) gemini | google/gemini-3.1-flash-image | $0.50/MTok | $3/MTok | 131.1K | Reasoning |
| Gemini 3 Flash gemini-flash | google/gemini-3-flash | $0.50/MTok | $3/MTok | 1M | Reasoning Tools |
| Gemini 3.1 Flash Image Preview (Nano Banana 2) gemini | google/gemini-3.1-flash-image-preview | $0.50/MTok | $3/MTok | 131.1K | Reasoning |
| GPT-3.5 Turbo gpt | openai/gpt-3.5-turbo | $0.50/MTok | $1.50/MTok | 16.4K | |
| Qwen3 Coder Next qwen | alibaba/qwen3-coder-next | $0.50/MTok | $1.20/MTok | 256K | Reasoning Tools |
| Qwen 3.6 Plus qwen | alibaba/qwen3.6-plus | $0.50/MTok | $3/MTok | 1M | Reasoning Tools |
| Kimi K2 Instruct kimi-k2 | moonshotai/kimi-k2 | $0.57/MTok | $2.30/MTok | 131.1K | Tools |
| Kimi K2.5 kimi-k2 | moonshotai/kimi-k2.5 | $0.60/MTok | $3/MTok | 262.1K | Reasoning Tools Open |
| GPT-Realtime mini gpt | openai/gpt-realtime-mini | $0.60/MTok | $2.40/MTok | — | |
| GLM 4.7 glm | zai/glm-4.7 | $0.60/MTok | $2.20/MTok | 200K | Reasoning Tools |
| GLM 4.5V glm | zai/glm-4.5v | $0.60/MTok | $1.80/MTok | 66K | Reasoning Tools Open |
| GLM 4.5 glm | zai/glm-4.5 | $0.60/MTok | $2.20/MTok | 128K | Reasoning Tools Open |
| GLM 4.6 glm | zai/glm-4.6 | $0.60/MTok | $2.20/MTok | 200K | Reasoning Tools Open |
| Nemotron 3 Ultra nemotron | nvidia/nemotron-3-ultra-550b-a55b | $0.60/MTok | $2.40/MTok | 1M | Reasoning Tools |
| Qwen 3.6 27B qwen3.6 | alibaba/qwen3.6-27b | $0.60/MTok | $3.60/MTok | 256K | Reasoning Tools |
| DeepSeek-V3.1 deepseek | deepseek/deepseek-v3.1 | $0.60/MTok | $1.70/MTok | 128K | Reasoning Tools |
| MiniMax M2.7 High Speed minimax | minimax/minimax-m2.7-highspeed | $0.60/MTok | $2.40/MTok | 204.8K | Reasoning Tools Open |
| MiniMax M2.5 High Speed minimax | minimax/minimax-m2.5-highspeed | $0.60/MTok | $2.40/MTok | 204.8K | Reasoning Tools |
| DeepSeek V3.2 Thinking deepseek-thinking | deepseek/deepseek-v3.2-thinking | $0.62/MTok | $1.85/MTok | 128K | Reasoning Tools |
| Llama 3.2 90B Vision Instruct llama | meta/llama-3.2-90b | $0.72/MTok | $0.72/MTok | 128K | Tools |
| Llama 3.1 70B Instruct llama | meta/llama-3.1-70b | $0.72/MTok | $0.72/MTok | 128K | Tools |
| GPT 5.4 Mini gpt | openai/gpt-5.4-mini | $0.75/MTok | $4.50/MTok | 400K | Reasoning Tools |
| Morph v3 Fast morph | morph/morph-v3-fast | $0.80/MTok | $1.20/MTok | 16K | |
| Claude 3.5 Haiku claude-haiku | anthropic/claude-3.5-haiku | $0.80/MTok | $4/MTok | 200K | Tools |
| Nova Pro nova-pro | amazon/nova-pro | $0.80/MTok | $3.20/MTok | 300K | Tools |
| Morph v3 Large morph | morph/morph-v3-large | $0.90/MTok | $1.90/MTok | 32K | |
| Kimi K2.7 Code kimi-k2 | moonshotai/kimi-k2.7-code | $0.95/MTok | $4/MTok | 256K | Reasoning Tools |
| Kimi K2.6 kimi-k2 | moonshotai/kimi-k2.6 | $0.95/MTok | $4/MTok | 262K | Reasoning Tools Open |
| GLM-5 glm | zai/glm-5 | $0.95/MTok | $3.15/MTok | 202.8K | Reasoning Tools Open |
| Grok Build 0.1 grok-build | xai/grok-build-0.1 | $1/MTok | $2/MTok | 256K | Reasoning Tools |
| MiMo V2 Pro mimo | xiaomi/mimo-v2-pro | $1/MTok | $3/MTok | 1M | Reasoning Tools |
| Claude Haiku 4.5 claude-haiku | anthropic/claude-haiku-4.5 | $1/MTok | $5/MTok | 200K | Reasoning Tools |
| Qwen3 Coder Plus qwen | alibaba/qwen3-coder-plus | $1/MTok | $5/MTok | 1M | Tools Open |
| o3-mini o-mini | openai/o3-mini | $1.10/MTok | $4.40/MTok | 200K | Reasoning Tools |
| o4-mini o-mini | openai/o4-mini | $1.10/MTok | $4.40/MTok | 200K | Reasoning Tools |
| GLM 5V Turbo glm | zai/glm-5v-turbo | $1.20/MTok | $4/MTok | 200K | Reasoning Tools |
| GLM 5 Turbo glm | zai/glm-5-turbo | $1.20/MTok | $4/MTok | 202.8K | Reasoning Tools |
| Qwen3 Max Preview qwen | alibaba/qwen3-max-preview | $1.20/MTok | $6/MTok | 262.1K | Tools |
| Qwen3 Max qwen | alibaba/qwen3-max | $1.20/MTok | $6/MTok | 262.1K | Tools |
| Qwen 3 Max Thinking qwen | alibaba/qwen3-max-thinking | $1.20/MTok | $6/MTok | 256K | Reasoning Tools Open |
| Grok 4.20 Beta Non-Reasoning grok | xai/grok-4.20-non-reasoning-beta | $1.25/MTok | $2.50/MTok | 2M | Tools |
| Grok 4.3 grok | xai/grok-4.3 | $1.25/MTok | $2.50/MTok | 1M | Reasoning Tools |
| Grok 4.20 Multi Agent Beta grok | xai/grok-4.20-multi-agent-beta | $1.25/MTok | $2.50/MTok | 2M | Reasoning Tools |
| Grok 4.20 Reasoning grok | xai/grok-4.20-reasoning | $1.25/MTok | $2.50/MTok | 2M | Reasoning Tools |
| Grok 4.20 Beta Reasoning grok | xai/grok-4.20-reasoning-beta | $1.25/MTok | $2.50/MTok | 2M | Reasoning Tools |
| Grok 4.20 Non-Reasoning grok | xai/grok-4.20-non-reasoning | $1.25/MTok | $2.50/MTok | 2M | Tools |
| Grok 4.20 Multi-Agent grok | xai/grok-4.20-multi-agent | $1.25/MTok | $2.50/MTok | 2M | Reasoning Tools |
| Gemini 2.5 Pro gemini-pro | google/gemini-2.5-pro | $1.25/MTok | $10/MTok | 1.0M | Reasoning Tools |
| GPT-5 Chat gpt | openai/gpt-5-chat | $1.25/MTok | $10/MTok | 128K | Reasoning Tools |
| GPT 5.1 Thinking gpt | openai/gpt-5.1-thinking | $1.25/MTok | $10/MTok | 400K | Reasoning Tools |
| GPT-5.1-Codex gpt | openai/gpt-5.1-codex | $1.25/MTok | $10/MTok | 400K | Reasoning Tools |
| GPT 5.1 Codex Max gpt | openai/gpt-5.1-codex-max | $1.25/MTok | $10/MTok | 400K | Reasoning Tools |
| GPT-4o mini Transcribe o-mini | openai/gpt-4o-mini-transcribe | $1.25/MTok | $5/MTok | — | |
| GPT-5.1 Instant gpt | openai/gpt-5.1-instant | $1.25/MTok | $10/MTok | 128K | Tools |
| GPT-5-Codex gpt-codex | openai/gpt-5-codex | $1.25/MTok | $10/MTok | 400K | Reasoning Tools |
| GPT-5 gpt | openai/gpt-5 | $1.25/MTok | $10/MTok | 400K | Reasoning Tools |
| Qwen 3.7 Max qwen | alibaba/qwen3.7-max | $1.25/MTok | $3.75/MTok | 991K | Reasoning Tools |
| GLM 5.1 glm | zai/glm-5.1 | $1.30/MTok | $4.30/MTok | 202K | Reasoning Tools |
| Qwen 3.6 Max Preview qwen | alibaba/qwen-3.6-max-preview | $1.30/MTok | $7.80/MTok | 240K | Reasoning Tools Open |
| DeepSeek-R1 deepseek-thinking | deepseek/deepseek-r1 | $1.35/MTok | $5.40/MTok | 128K | Reasoning Tools |
| Mistral Medium Latest mistral-medium | mistral/mistral-medium-3.5 | $1.50/MTok | $7.50/MTok | 256K | Reasoning Tools |
| Gemini 3.5 Flash gemini | google/gemini-3.5-flash | $1.50/MTok | $9/MTok | 1M | Reasoning Tools |
| GPT-3.5 Turbo Instruct gpt | openai/gpt-3.5-turbo-instruct | $1.50/MTok | $2/MTok | 8.2K | |
| GLM 5.2 glm | zai/glm-5.2 | $1.50/MTok | $4.50/MTok | 1M | Reasoning Tools Open |
| Interfaze Beta | interfaze/interfaze-beta | $1.50/MTok | $3.50/MTok | 1M | Reasoning |
| Qwen3 Coder 480B A35B Instruct qwen | alibaba/qwen3-coder | $1.50/MTok | $7.50/MTok | 262.1K | Reasoning Tools |
| GPT-5.2 Chat gpt | openai/gpt-5.2-chat | $1.75/MTok | $14/MTok | 128K | Tools |
| GPT-5.3 Chat gpt | openai/gpt-5.3-chat | $1.75/MTok | $14/MTok | 128K | Tools |
| GPT-5.2 gpt | openai/gpt-5.2 | $1.75/MTok | $14/MTok | 400K | Reasoning Tools |
| GPT 5.3 Codex gpt | openai/gpt-5.3-codex | $1.75/MTok | $14/MTok | 400K | Reasoning Tools |
| GPT-5.2-Codex gpt-codex | openai/gpt-5.2-codex | $1.75/MTok | $14/MTok | 400K | Reasoning Tools |
| Kimi K2.7 Code High Speed kimi-k2 | moonshotai/kimi-k2.7-code-highspeed | $1.90/MTok | $8/MTok | 262.1K | Reasoning Tools |
| Pixtral Large (latest) pixtral | mistral/pixtral-large | $2/MTok | $6/MTok | 128K | Tools Open |
| Magistral Medium (latest) magistral-medium | mistral/magistral-medium | $2/MTok | $5/MTok | 128K | Reasoning Tools Open |
| Gemini 3.1 Pro Preview gemini | google/gemini-3.1-pro-preview | $2/MTok | $12/MTok | 1M | Reasoning Tools |
| Gemini 3 Pro Preview gemini-pro | google/gemini-3-pro-preview | $2/MTok | $12/MTok | 1M | Reasoning Tools |
| Nano Banana Pro (Gemini 3 Pro Image) gemini-pro | google/gemini-3-pro-image | $2/MTok | $12/MTok | 65.5K | |
| GPT Image 1 Mini gpt-image | openai/gpt-image-1-mini | $2/MTok | $8/MTok | — | |
| GPT-4.1 gpt | openai/gpt-4.1 | $2/MTok | $8/MTok | 1.0M | Tools |
| o3 o | openai/o3 | $2/MTok | $8/MTok | 200K | Reasoning Tools |
| GPT-4o Transcribe gpt | openai/gpt-4o-transcribe | $2.50/MTok | $10/MTok | — | |
| GPT 5.4 gpt | openai/gpt-5.4 | $2.50/MTok | $15/MTok | 1.1M | Reasoning Tools |
| GPT-4o gpt | openai/gpt-4o | $2.50/MTok | $10/MTok | 128K | Tools |
| Command A command | cohere/command-a | $2.50/MTok | $10/MTok | 256K | Tools |
| GLM 5.2 Fast glm | zai/glm-5.2-fast | $3/MTok | $10.25/MTok | 1M | Reasoning Tools |
| Claude Sonnet 4.6 claude-sonnet | anthropic/claude-sonnet-4.6 | $3/MTok | $15/MTok | 1M | Reasoning Tools |
| Claude Sonnet 4 claude-sonnet | anthropic/claude-sonnet-4 | $3/MTok | $15/MTok | 200K | Reasoning Tools |
| Claude Sonnet 4.5 claude-sonnet | anthropic/claude-sonnet-4.5 | $3/MTok | $15/MTok | 200K | Reasoning Tools |
| GPT-Realtime-1.5 gpt | openai/gpt-realtime-1.5 | $4/MTok | $16/MTok | — | |
| gpt-realtime-2 gpt | openai/gpt-realtime-2 | $4/MTok | $24/MTok | — | |
| GPT Image 1.5 gpt-image | openai/gpt-image-1.5 | $5/MTok | $32/MTok | — | |
| GPT Image 1 gpt-image | openai/gpt-image-1 | $5/MTok | $40/MTok | — | |
| GPT Image 2 gpt-image | openai/gpt-image-2 | $5/MTok | $30/MTok | — | |
| GPT 5.5 gpt | openai/gpt-5.5 | $5/MTok | $30/MTok | 1M | Reasoning Tools |
| Fugu Ultra aura | sakana/fugu-ultra | $5/MTok | $30/MTok | 1M | Reasoning Tools |
| Claude Opus 4.7 claude-opus | anthropic/claude-opus-4.7 | $5/MTok | $25/MTok | 1M | Reasoning Tools |
| Claude Opus 4.8 claude-opus | anthropic/claude-opus-4.8 | $5/MTok | $25/MTok | 1M | Reasoning Tools |
| Claude Opus 4.5 claude-opus | anthropic/claude-opus-4.5 | $5/MTok | $25/MTok | 200K | Reasoning Tools |
| Claude Opus 4.6 claude-opus | anthropic/claude-opus-4.6 | $5/MTok | $25/MTok | 1M | Reasoning Tools |
| o3-deep-research o | openai/o3-deep-research | $10/MTok | $40/MTok | 200K | Reasoning Tools |
| GPT-4 Turbo gpt | openai/gpt-4-turbo | $10/MTok | $30/MTok | 128K | Tools |
| GPT-5 pro gpt | openai/gpt-5-pro | $15/MTok | $120/MTok | 400K | Reasoning Tools |
| o1 o | openai/o1 | $15/MTok | $60/MTok | 200K | Reasoning Tools |
| Claude Opus 4 claude-opus | anthropic/claude-opus-4 | $15/MTok | $75/MTok | 200K | Reasoning Tools |
| Claude Opus 4.1 claude-opus | anthropic/claude-opus-4.1 | $15/MTok | $75/MTok | 200K | Reasoning Tools |
| o3 Pro o-pro | openai/o3-pro | $20/MTok | $80/MTok | 200K | Reasoning Tools |
| GPT 5.2 gpt | openai/gpt-5.2-pro | $21/MTok | $168/MTok | 400K | Reasoning Tools |
| GPT 5.4 Pro gpt | openai/gpt-5.4-pro | $30/MTok | $180/MTok | 1.1M | Reasoning Tools |
| GPT 5.5 Pro gpt | openai/gpt-5.5-pro | $30/MTok | $180/MTok | 1M | Reasoning Tools |
| Grok Imagine Video 1.5 grok | xai/grok-imagine-video-1.5 | —/MTok | —/MTok | — | |
| Grok TTS grok | xai/grok-tts | —/MTok | —/MTok | — | |
| Grok Voice Think Fast 1.0 grok | xai/grok-voice-think-fast-1.0 | —/MTok | —/MTok | — | |
| Grok Imagine grok | xai/grok-imagine-video | —/MTok | —/MTok | — | |
| Grok STT grok | xai/grok-stt | —/MTok | —/MTok | — | |
| Grok Imagine Video 1.5 Preview grok | xai/grok-imagine-video-1.5-preview | —/MTok | —/MTok | — | |
| Grok Imagine Image grok | xai/grok-imagine-image | —/MTok | —/MTok | — | |
| Kling v3.0 Motion Control ling | klingai/kling-v3.0-motion-control | —/MTok | —/MTok | — | |
| Kling v2.6 Image-to-Video ling | klingai/kling-v2.6-i2v | —/MTok | —/MTok | — | |
| Kling v2.5 Turbo Text-to-Video ling | klingai/kling-v2.5-turbo-t2v | —/MTok | —/MTok | — | |
| Kling v3.0 Image-to-Video ling | klingai/kling-v3.0-i2v | —/MTok | —/MTok | — | |
| Kling v2.5 Turbo Image-to-Video ling | klingai/kling-v2.5-turbo-i2v | —/MTok | —/MTok | — | |
| Kling v3.0 Text-to-Video ling | klingai/kling-v3.0-t2v | —/MTok | —/MTok | — | |
| Kling v2.6 Motion Control ling | klingai/kling-v2.6-motion-control | —/MTok | —/MTok | — | |
| Kling v2.6 Text-to-Video ling | klingai/kling-v2.6-t2v | —/MTok | —/MTok | — | |
| voyage-4-lite voyage | voyage/voyage-4-lite | —/MTok | —/MTok | 32K | |
| voyage-law-2 voyage | voyage/voyage-law-2 | —/MTok | —/MTok | 8.2K | |
| voyage-4 voyage | voyage/voyage-4 | —/MTok | —/MTok | 32K | |
| voyage-code-3 voyage | voyage/voyage-code-3 | —/MTok | —/MTok | 8.2K | |
| voyage-4-large voyage | voyage/voyage-4-large | —/MTok | —/MTok | 32K | |
| Voyage Rerank 2.5 voyage | voyage/rerank-2.5 | —/MTok | —/MTok | 32K | |
| Voyage Rerank 2.5 Lite voyage | voyage/rerank-2.5-lite | —/MTok | —/MTok | 32K | |
| voyage-code-2 voyage | voyage/voyage-code-2 | —/MTok | —/MTok | 8.2K | |
| voyage-3.5-lite voyage | voyage/voyage-3.5-lite | —/MTok | —/MTok | 8.2K | |
| voyage-3.5 voyage | voyage/voyage-3.5 | —/MTok | —/MTok | 8.2K | |
| voyage-3-large voyage | voyage/voyage-3-large | —/MTok | —/MTok | 8.2K | |
| voyage-finance-2 voyage | voyage/voyage-finance-2 | —/MTok | —/MTok | 8.2K | |
| Codestral Embed codestral-embed | mistral/codestral-embed | —/MTok | —/MTok | 8.2K | |
| Mistral Embed mistral-embed | mistral/mistral-embed | —/MTok | —/MTok | 8.2K | |
| Gemini Embedding 2 gemini-embedding | google/gemini-embedding-2 | —/MTok | —/MTok | — | |
| Text Multilingual Embedding 002 text-embedding | google/text-multilingual-embedding-002 | —/MTok | —/MTok | 8.2K | |
| Veo 3.1 veo | google/veo-3.1-generate-001 | —/MTok | —/MTok | — | |
| Veo 3.0 veo | google/veo-3.0-generate-001 | —/MTok | —/MTok | — | |
| Veo 3.0 Fast Generate veo | google/veo-3.0-fast-generate-001 | —/MTok | —/MTok | — | |
| Text Embedding 005 text-embedding | google/text-embedding-005 | —/MTok | —/MTok | 8.2K | |
| Gemini Embedding 001 gemini-embedding | google/gemini-embedding-001 | —/MTok | —/MTok | 8.2K | |
| Imagen 4 Fast imagen | google/imagen-4.0-fast-generate-001 | —/MTok | —/MTok | 480 | |
| Imagen 4 Ultra imagen | google/imagen-4.0-ultra-generate-001 | —/MTok | —/MTok | 480 | |
| Veo 3.1 Fast Generate veo | google/veo-3.1-fast-generate-001 | —/MTok | —/MTok | — | |
| Imagen 4 imagen | google/imagen-4.0-generate-001 | —/MTok | —/MTok | 480 | |
| Flux Schnell flux | prodia/flux-fast-schnell | —/MTok | —/MTok | 512 | |
| text-embedding-3-large text-embedding | openai/text-embedding-3-large | —/MTok | —/MTok | 8.2K | |
| text-embedding-ada-002 text-embedding | openai/text-embedding-ada-002 | —/MTok | —/MTok | 8.2K | |
| text-embedding-3-small text-embedding | openai/text-embedding-3-small | —/MTok | —/MTok | 8.2K | |
| TTS-1 o | openai/tts-1 | —/MTok | —/MTok | — | |
| Whisper whisper | openai/whisper-1 | —/MTok | —/MTok | — | |
| TTS-1 HD o | openai/tts-1-hd | —/MTok | —/MTok | — | |
| GLM-4.6V-Flash glm | zai/glm-4.6v-flash | —/MTok | —/MTok | 128K | Reasoning Tools |
| Seedream 5.0 Lite seed | bytedance/seedream-5.0-lite | —/MTok | —/MTok | — | |
| Seedance 2.0 Fast seed | bytedance/seedance-2.0-fast | —/MTok | —/MTok | — | |
| Seedance v1.0 Pro seed | bytedance/seedance-v1.0-pro | —/MTok | —/MTok | — | |
| Seedance 2.0 seed | bytedance/seedance-2.0 | —/MTok | —/MTok | — | |
| Seedance v1.5 Pro seed | bytedance/seedance-v1.5-pro | —/MTok | —/MTok | — | |
| Seedance v1.0 Pro Fast seed | bytedance/seedance-v1.0-pro-fast | —/MTok | —/MTok | — | |
| Seedream 4.0 seed | bytedance/seedream-4.0 | —/MTok | —/MTok | — | |
| Seedream 4.5 seed | bytedance/seedream-4.5 | —/MTok | —/MTok | — | |
| Arrow 1.1 o | quiverai/arrow-1.1 | —/MTok | —/MTok | 131.1K | |
| Cohere Rerank 3.5 o | cohere/rerank-v3.5 | —/MTok | —/MTok | 4.1K | |
| Cohere Rerank 4 Fast o | cohere/rerank-v4-fast | —/MTok | —/MTok | 32K | |
| Embed v4.0 cohere-embed | cohere/embed-v4.0 | —/MTok | —/MTok | 128K | |
| Cohere Rerank 4 Pro o | cohere/rerank-v4-pro | —/MTok | —/MTok | 32K | |
| FLUX.1 Kontext Max flux | bfl/flux-kontext-max | —/MTok | —/MTok | 512 | |
| FLUX.2 [flex] flux | bfl/flux-2-flex | —/MTok | —/MTok | — | |
| FLUX1.1 [pro] Ultra flux | bfl/flux-pro-1.1-ultra | —/MTok | —/MTok | 512 | |
| FLUX.2 [max] flux | bfl/flux-2-max | —/MTok | —/MTok | 67.3K | |
| FLUX1.1 [pro] flux | bfl/flux-pro-1.1 | —/MTok | —/MTok | 512 | |
| FLUX.1 Fill [pro] flux | bfl/flux-pro-1.0-fill | —/MTok | —/MTok | 512 | |
| FLUX.2 [klein] 4B flux | bfl/flux-2-klein-4b | —/MTok | —/MTok | — | |
| FLUX.2 [klein] 9B flux | bfl/flux-2-klein-9b | —/MTok | —/MTok | — | |
| FLUX.1 Kontext Pro flux | bfl/flux-kontext-pro | —/MTok | —/MTok | 512 | |
| FLUX.2 [pro] flux | bfl/flux-2-pro | —/MTok | —/MTok | 67.3K | |
| Recraft V4.1 Pro recraft | recraft/recraft-v4.1-pro | —/MTok | —/MTok | — | |
| Recraft V4.1 recraft | recraft/recraft-v4.1 | —/MTok | —/MTok | — | |
| Recraft V4 recraft | recraft/recraft-v4 | —/MTok | —/MTok | — | |
| Recraft V4 Pro recraft | recraft/recraft-v4-pro | —/MTok | —/MTok | — | |
| Recraft V2 recraft | recraft/recraft-v2 | —/MTok | —/MTok | 512 | |
| Recraft V3 recraft | recraft/recraft-v3 | —/MTok | —/MTok | 512 | |
| Recraft V4.1 Utility Pro recraft | recraft/recraft-v4.1-utility-pro | —/MTok | —/MTok | — | |
| Recraft V4.1 Utility recraft | recraft/recraft-v4.1-utility | —/MTok | —/MTok | — | |
| Sonar Reasoning Pro sonar-reasoning | perplexity/sonar-reasoning-pro | —/MTok | —/MTok | 127K | Reasoning |
| Sonar sonar | perplexity/sonar | —/MTok | —/MTok | 127K | Tools |
| Sonar Pro sonar-pro | perplexity/sonar-pro | —/MTok | —/MTok | 200K | Tools |
| Titan Text Embeddings V2 titan-embed | amazon/titan-embed-text-v2 | —/MTok | —/MTok | 8.2K | |
| Wan v2.6 Reference-to-Video o | alibaba/wan-v2.6-r2v | —/MTok | —/MTok | — | |
| Qwen3 Embedding 0.6B qwen | alibaba/qwen3-embedding-0.6b | —/MTok | —/MTok | 32.8K | |
| Qwen3 Embedding 8B qwen | alibaba/qwen3-embedding-8b | —/MTok | —/MTok | 32.8K | |
| Wan v2.5 Text-to-Video Preview o | alibaba/wan-v2.5-t2v-preview | —/MTok | —/MTok | — | |
| Qwen3 Embedding 4B qwen | alibaba/qwen3-embedding-4b | —/MTok | —/MTok | 32.8K | |
| Wan v2.6 Text-to-Video o | alibaba/wan-v2.6-t2v | —/MTok | —/MTok | — | |
| Wan v2.6 Image-to-Video Flash o | alibaba/wan-v2.6-i2v-flash | —/MTok | —/MTok | — | |
| Wan v2.6 Reference-to-Video Flash o | alibaba/wan-v2.6-r2v-flash | —/MTok | —/MTok | — | |
| Wan v2.6 Image-to-Video o | alibaba/wan-v2.6-i2v | —/MTok | —/MTok | — | |
| LongCat Flash Thinking 2601 longcat | meituan/longcat-flash-thinking-2601 | —/MTok | —/MTok | 32.8K | Reasoning |
| LongCat Flash Chat longcat | meituan/longcat-flash-chat | —/MTok | —/MTok | 128K | Tools |