All Models

NVIDIA: Llama 3.1 Nemotron Ultra 253B v1

Reasoning Open Weights

Llama-3.1-Nemotron-Ultra-253B-v1 is a large language model (LLM) optimized for advanced reasoning, human-interactive chat, retrieval-augmented generation (RAG), and tool-calling tasks. Derived from Meta’s Llama-3.1-405B-Instruct, it has been significantly customized using Neural...

Providers 1
Released Apr 8, 2025
Input Modalities text
Output Modalities text

Available Providers (1)

Provider Model ID Input Cost Output Cost Context Max Output Docs
Kilo Gateway nvidia/llama-3.1-nemotron-ultra-253b-v1 $0.60/MTok $1.80/MTok 131.1K 131.1K

Capabilities

Reasoning
Tool Calling
Attachments
Open Weights
Structured Output