NVIDIA

NVIDIA Nemotron Nano 9B V2 (Non-reasoning)

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and tasks by first generating a reasoning trace and then concluding with a final response. The model's reasoning capabilities can be controlled via a system prompt. If the user prefers the model to provide its final answer without intermediate reasoning traces, it can be configured to do so.\

NVIDIA model catalog

TypeLanguage

ReleaseAug 18, 2025

Context128,000

Tagsreasoning, tool-use

7.4

n/a

$0.086/M

* Cost is inverted: lower input, output, and blended prices rank higher.

Model Details

Runtime

Output Speed176.5 tok/s

First Token1.17s

Max Output131,072

Pricing

Blended$0.086/M

Input$0.050/M

All Benchmarks

Metric	Domain	Value	Rank
Artificial Analysis Intelligence Index	overall	7.4	#461
Artificial Analysis Math Index	math	62.3	#114
MMLU-Pro	reasoning	73.9%	#171
GPQA	reasoning	55.7%	#378
Humanity's Last Exam

NVIDIA Nemotron Nano 9B V2 (Non-reasoning)

Model Snapshot

Overall

Coding

Blended Price

Strength Profile

Benchmark Percentiles

Model Details

Runtime

Pricing

All Benchmarks

Metadata