NVIDIA
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) is a Meta Llama 3 family model, part of Meta's widely deployed open-weight LLM series for assistants, retrieval, coding, and self-hosted applications. The profile helps compare this specific size and instruction variant with newer open and proprietary models in the local snapshot.
Introducing Meta Llama 3Queryable facts extracted from the upstream model payload.
Rank #32 across 194 models.
Rank #67 across 345 models.
Rank #41 across 201 models.
Rank #252 across 293 models.
Rank #278 across 410 models.
Percentile score by analysis domain.
* Cost is inverted: lower input, output, and blended prices rank higher.
Higher bars mean stronger relative placement.
| Metric | Domain | Value | Rank |
|---|---|---|---|
| Artificial Analysis Intelligence Index | overall | 15.0 | #317 |
| Artificial Analysis Coding Index | coding | 13.1 | #278 |
| Artificial Analysis Math Index | math | 63.7 | #109 |
| MMLU-Pro | reasoning | 82.5% | #67 |
| reasoning |
| 72.8% |
| #174 |
| Humanity's Last Exam | reasoning | 8.1% | #186 |
| LiveCodeBench | coding | 64.1% | #102 |
| SciCode | coding, reasoning | 34.7% | #194 |
| MATH-500 | math | 95.2% | #41 |
| AIME | math | 74.7% | #32 |
| Output Speed | speed | 41 tok/s | #252 |
| Time to First Token | speed | 0.71s | #92 |
| Blended Price | cost | $0.900/M | #170 |
| Input Price | cost | $0.600/M | #188 |
| Output Price | cost | $1.80/M | #146 |
| Value Index | cost, overall | 16.7 | #204 |