EBEasy BenchmarksLLM model index
Workspace
Overview
Benchmarks
Benchmarks list
Overall Index
Coding
Math
MMLU-Pro
Speed
Value
Models
All models
GPT-5.5 (xhigh)
GPT-5.5 (high)
Claude Opus 4.7 (Adaptive Reasoning, Max Effort)
Gemini 3.1 Pro Preview
GPT-5.4 (xhigh)
Artificial Analysis data
Back

NVIDIA

Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)

Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) is a Meta Llama 3 family model, part of Meta's widely deployed open-weight LLM series for assistants, retrieval, coding, and self-hosted applications. The profile helps compare this specific size and instruction variant with newer open and proprietary models in the local snapshot.

Introducing Meta Llama 3

Operational Metrics

Output Speed41 tok/s
First Token0.71s
Blended Price$0.900/M

Model Metadata

Queryable facts extracted from the upstream model payload.

ReleaseApr 7, 2025
Context Windown/a
Modalitiesn/a
API fields: release_date

Strength: AIME

Rank #32 across 194 models.

74.7%

Strength: MMLU-Pro

Rank #67 across 345 models.

82.5%

Strength: MATH-500

Rank #41 across 201 models.

95.2%

Watch Area: Speed

Rank #252 across 293 models.

41 tok/s

Watch Area: Coding

Rank #278 across 410 models.

13.1

Strength Profile

Percentile score by analysis domain.

* Cost is inverted: lower input, output, and blended prices rank higher.

Benchmark Percentiles

Higher bars mean stronger relative placement.

All Benchmarks

MetricDomainValueRank
Artificial Analysis Intelligence Indexoverall15.0#317
Artificial Analysis Coding Indexcoding13.1#278
Artificial Analysis Math Indexmath63.7#109
MMLU-Proreasoning82.5%#67
GPQA
reasoning
72.8%
#174
Humanity's Last Examreasoning8.1%#186
LiveCodeBenchcoding64.1%#102
SciCodecoding, reasoning34.7%#194
MATH-500math95.2%#41
AIMEmath74.7%#32
Output Speedspeed41 tok/s#252
Time to First Tokenspeed0.71s#92
Blended Pricecost$0.900/M#170
Input Pricecost$0.600/M#188
Output Pricecost$1.80/M#146
Value Indexcost, overall16.7#204