All Models
Qwen/Qwen3-VL-30B-A3B-Instruct
Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Instruct variant optimizes instruction-following for general multimodal tasks. It excels in perception...
Available Providers (4)
| Provider | Model ID | Input Cost | Output Cost | Context | Max Output | Docs |
|---|---|---|---|---|---|---|
| | qwen/qwen3-vl-30b-a3b-instruct | $0.13/MTok | $0.52/MTok | 131.1K | 32.8K | |
| | qwen/qwen3-vl-30b-a3b-instruct | $0.20/MTok | $0.70/MTok | 131.1K | 32.8K | |
| | Qwen/Qwen3-VL-30B-A3B-Instruct | $0.29/MTok | $1.00/MTok | 262K | 262K | |
| | Qwen/Qwen3-VL-30B-A3B-Instruct | $0.29/MTok | $1.00/MTok | 262K | 262K |
Capabilities
Reasoning
Tool Calling
Attachments
Open Weights
Structured Output