All Providers
27 Models
11 Families
1.0M Max Context
$0.03–$1.30 Input Cost/MTok
$0.14–$3.50 Output Cost/MTok
Capabilities
Reasoning
Tool Calling
Attachments
Open Weights
Setup
Set the following environment variable to use Deep Infra:
DEEPINFRA_API_KEY Models (27)
| Model | Model ID | Input Cost | Output Cost | Context | Capabilities |
|---|---|---|---|---|---|
| GPT OSS 20B gpt-oss | openai/gpt-oss-20b | $0.03/MTok | $0.14/MTok | 131.1K | Reasoning Tools Open |
| GPT OSS 120B gpt-oss | openai/gpt-oss-120b | $0.04/MTok | $0.19/MTok | 131.1K | Reasoning Tools Open |
| GLM-4.7-Flash glm-flash | zai-org/GLM-4.7-Flash | $0.06/MTok | $0.40/MTok | 202.8K | Reasoning Tools Open |
| Gemma 4 26B A4B IT gemma | google/gemma-4-26B-A4B-it | $0.07/MTok | $0.34/MTok | 262.1K | Reasoning Tools Open |
| Llama 4 Scout 17B llama | meta-llama/Llama-4-Scout-17B-16E-Instruct | $0.10/MTok | $0.30/MTok | 327.7K | Tools Open |
| Llama 3.3 70B Turbo llama | meta-llama/Llama-3.3-70B-Instruct-Turbo | $0.10/MTok | $0.32/MTok | 131.1K | Tools Open |
| DeepSeek V4 Flash deepseek-flash | deepseek-ai/DeepSeek-V4-Flash | $0.10/MTok | $0.20/MTok | 1.0M | Reasoning Tools Open |
| Gemma 4 31B IT gemma | google/gemma-4-31B-it | $0.13/MTok | $0.38/MTok | 262.1K | Reasoning Tools Open |
| Qwen 3.5 35B A3B qwen | Qwen/Qwen3.5-35B-A3B | $0.14/MTok | $1/MTok | 262.1K | Reasoning Tools Open |
| Llama 4 Maverick 17B FP8 llama | meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | $0.15/MTok | $0.60/MTok | 1.0M | Open |
| Qwen3.6 35B A3B qwen | Qwen/Qwen3.6-35B-A3B | $0.15/MTok | $0.95/MTok | 262.1K | Reasoning Tools Open |
| MiniMax M2.5 minimax | MiniMaxAI/MiniMax-M2.5 | $0.15/MTok | $1.15/MTok | 196.6K | Reasoning Tools Open |
| DeepSeek-V3.2 | deepseek-ai/DeepSeek-V3.2 | $0.26/MTok | $0.38/MTok | 163.8K | Reasoning Tools |
| Qwen3 Coder 480B A35B Instruct Turbo qwen | Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo | $0.30/MTok | $1/MTok | 262.1K | Tools Open |
| MiMo-V2.5 mimo | XiaomiMiMo/MiMo-V2.5 | $0.40/MTok | $2/MTok | 262.1K | Reasoning Tools Open |
| GLM-4.7 glm | zai-org/GLM-4.7 | $0.40/MTok | $1.75/MTok | 202.8K | Reasoning Tools Open |
| GLM-4.6 glm | zai-org/GLM-4.6 | $0.43/MTok | $1.74/MTok | 202.8K | Reasoning Tools Open |
| Kimi K2.5 kimi-k2 | moonshotai/Kimi-K2.5 | $0.45/MTok | $2.25/MTok | 262.1K | Reasoning Tools Open |
| Qwen 3.5 397B A17B qwen | Qwen/Qwen3.5-397B-A17B | $0.45/MTok | $3/MTok | 262.1K | Reasoning Tools Open |
| DeepSeek-R1-0528 | deepseek-ai/DeepSeek-R1-0528 | $0.50/MTok | $2.15/MTok | 163.8K | Reasoning Tools |
| GLM-5 glm | zai-org/GLM-5 | $0.60/MTok | $2.08/MTok | 202.8K | Reasoning Tools Open |
| Kimi K2.7 Code kimi-k2 | moonshotai/Kimi-K2.7-Code | $0.74/MTok | $3.50/MTok | 262.1K | Reasoning Tools Open |
| Kimi K2.6 kimi-k2 | moonshotai/Kimi-K2.6 | $0.75/MTok | $3.50/MTok | 262.1K | Reasoning Tools Open |
| GLM-5.2 glm | zai-org/GLM-5.2 | $0.95/MTok | $3/MTok | 1.0M | Reasoning Tools Open |
| MiMo-V2.5-Pro mimo | XiaomiMiMo/MiMo-V2.5-Pro | $1/MTok | $3/MTok | 1.0M | Reasoning Tools Open |
| GLM-5.1 glm | zai-org/GLM-5.1 | $1.05/MTok | $3.50/MTok | 202.8K | Reasoning Tools Open |
| DeepSeek V4 Pro deepseek-thinking | deepseek-ai/DeepSeek-V4-Pro | $1.30/MTok | $2.60/MTok | 1.0M | Reasoning Tools Open |