All Models

Qwen/Qwen3-VL-30B-A3B-Instruct

qwen Tool Calling Attachments Open Weights Structured Output

Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Instruct variant optimizes instruction-following for general multimodal tasks. It excels in perception...

Providers 4
Released Oct 5, 2025
Input Modalities text, image, video
Output Modalities text
Tarsk Use coding

Available Providers (4)

Provider Model ID Input Cost Output Cost Context Max Output Docs
Kilo Gateway qwen/qwen3-vl-30b-a3b-instruct $0.13/MTok $0.52/MTok 131.1K 32.8K
NovitaAI qwen/qwen3-vl-30b-a3b-instruct $0.20/MTok $0.70/MTok 131.1K 32.8K
SiliconFlow (China) Qwen/Qwen3-VL-30B-A3B-Instruct $0.29/MTok $1.00/MTok 262K 262K
SiliconFlow Qwen/Qwen3-VL-30B-A3B-Instruct $0.29/MTok $1.00/MTok 262K 262K

Capabilities

Reasoning
Tool Calling
Attachments
Open Weights
Structured Output