NVIDIA
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) is a Meta Llama 3 family model, part of Meta's widely deployed open-weight LLM series for assistants, retrieval, coding, and self-hosted applications. The profile helps compare this specific size and instruction variant with newer open and proprietary models in the local snapshot.
Introducing Meta Llama 3Rank #324 across 526
Rank #300 across 436
Rank #200 across 357
Percentile score by analysis domain.
* Cost is inverted: lower input, output, and blended prices rank higher.
Higher bars mean stronger relative placement.
| Metric | Domain | Value | Rank |
|---|---|---|---|
| Artificial Analysis Intelligence Index | overall | 15.0 | #324 |
| Artificial Analysis Coding Index | coding | 13.1 | #300 |
| Artificial Analysis Math Index | math | 63.7 | #109 |
| MMLU-Pro | reasoning | 82.5% | #67 |
| reasoning |
| 72.8% |
| #193 |
| Humanity's Last Exam | reasoning | 8.1% | #203 |
| LiveCodeBench | coding | 64.1% | #102 |
| SciCode | coding, reasoning | 34.7% | #213 |
| MATH-500 | math | 95.2% | #41 |
| AIME | math | 74.7% | #32 |
| Output Speed | speed | 52.7 tok/s | #232 |
| Time to First Token | speed | 0.72s | #88 |
| Blended Price | cost | $0.900/M | #200 |
| Input Price | cost | $0.600/M | #216 |
| Output Price | cost | $1.80/M | #172 |
| Value Index | cost, overall | 16.7 | #228 |