NVIDIA
NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and tasks by first generating a reasoning trace and then concluding with a final response. The model's reasoning capabilities can be controlled via a system prompt. If the user prefers the model to provide its final answer without intermediate reasoning traces, it can be configured to do so.\
NVIDIA model catalogRank #366 across 526
Rank #365 across 436
Rank #18 across 357
Percentile score by analysis domain.
* Cost is inverted: lower input, output, and blended prices rank higher.
Higher bars mean stronger relative placement.
| Metric | Domain | Value | Rank |
|---|---|---|---|
| Artificial Analysis Intelligence Index | overall | 13.2 | #366 |
| Artificial Analysis Coding Index | coding | 7.5 | #365 |
| Artificial Analysis Math Index | math | 62.3 | #114 |
| MMLU-Pro | reasoning | 73.9% | #195 |
| reasoning |
| 55.7% |
| #328 |
| Humanity's Last Exam | reasoning | 4.0% | #439 |
| LiveCodeBench | coding | 70.1% | #69 |
| SciCode | coding, reasoning | 20.9% | #379 |
| Output Speed | speed | 133.6 tok/s | #93 |
| Time to First Token | speed | 0.72s | #90 |
| Blended Price | cost | $0.086/M | #18 |
| Input Price | cost | $0.050/M | #24 |
| Output Price | cost | $0.195/M | #24 |
| Value Index | cost, overall | 153.5 | #31 |