NVIDIA

Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)

Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) is a Meta Llama 3 family model, part of Meta's widely deployed open-weight LLM series for assistants, retrieval, coding, and self-hosted applications. The profile helps compare this specific size and instruction variant with newer open and proprietary models in the local snapshot.

Introducing Meta Llama 3

ReleaseApr 7, 2025

9.1

n/a

$0.900/M

* Cost is inverted: lower input, output, and blended prices rank higher.

Model Details

Runtime

Output Speed52.1 tok/s

First Token0.69s

Max Outputn/a

Pricing

Blended$0.900/M

All Benchmarks

Metric	Domain	Value	Rank
Artificial Analysis Intelligence Index	overall	9.1	#412
Artificial Analysis Math Index	math	63.7	#109
MMLU-Pro	reasoning	82.5%	#62
GPQA	reasoning	72.8%	#239
Humanity's Last Exam

Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)

Model Snapshot

Overall

Coding

Blended Price

Strength Profile

Benchmark Percentiles

Model Details

Runtime

Pricing

All Benchmarks

Metadata