NVIDIA

NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning)

NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s...

NVIDIA model catalog

ReleaseOct 28, 2025

Context128,000

4.6

n/a

$0.300/M

* Cost is inverted: lower input, output, and blended prices rank higher.

Model Details

Runtime

Output Speed156.1 tok/s

First Token2.32s

Max Output128,000

Pricing

Blended$0.300/M

Input$0.200/M

All Benchmarks

Metric	Domain	Value	Rank
Artificial Analysis Intelligence Index	overall	4.6	#520
Artificial Analysis Math Index	math	26.7	#193
MMLU-Pro	reasoning	64.9%	#223
GPQA	reasoning	43.9%	#439
Humanity's Last Exam

NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning)

Model Snapshot

Overall

Coding

Blended Price

Strength Profile

Benchmark Percentiles

Model Details

Runtime

Pricing

All Benchmarks

Metadata