All Models

GPT Audio

Tool Calling Attachments Structured Output

The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced...

Providers 2
Released Jan 19, 2026
Input Modalities audio, text
Output Modalities audio, text
Tarsk Use coding

Available Providers (2)

Provider Model ID Input Cost Output Cost Context Max Output Docs
Kilo Gateway openai/gpt-audio $2.50/MTok $10/MTok 128K 16.4K
OpenRouter openai/gpt-audio $2.50/MTok $10/MTok 128K 16.4K

Capabilities

Reasoning
Tool Calling
Attachments
Open Weights
Structured Output