All Models

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

Reasoning Tool Calling

Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...

Providers 1
Released Mar 16, 2025
Input Modalities text
Output Modalities text
Tarsk Use coding

Available Providers (1)

Provider Model ID Input Cost Output Cost Context Max Output Docs
Kilo Gateway nvidia/llama-3.3-nemotron-super-49b-v1.5 $0.10/MTok $0.40/MTok 131.1K 26.2K

Capabilities

Reasoning
Tool Calling
Attachments
Open Weights
Structured Output