qwen/qwen3-vl-30b-a3b-instruct

Tool Calling Attachments Open Weights Structured Output

Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Instruct variant optimizes instruction-following for general multimodal tasks. It excels in perception...

Providers 3

Released Oct 5, 2025

Input Modalities text, video, image

Output Modalities text

Tarsk Use coding

Benchmarks

Intelligence Index

72.3

Math Index

Available Providers (3)

Provider	Model ID	Input Cost	Output Cost	Context	Max Output
NovitaAI	`qwen/qwen3-vl-30b-a3b-instruct`	$0.20/MTok	$0.70/MTok	131.1K	32.8K
SiliconFlow (China)	`Qwen/Qwen3-VL-30B-A3B-Instruct`	$0.29/MTok	$1/MTok	262K	262K
SiliconFlow	`Qwen/Qwen3-VL-30B-A3B-Instruct`	$0.29/MTok	$1/MTok	262K	262K

Capabilities

Reasoning

Tool Calling

Attachments

Open Weights

Structured Output