NVIDIA

Llama 3.1 Nemotron Instruct 70B

Llama 3.1 Nemotron Instruct 70B is a Meta Llama 3 family model, part of Meta's widely deployed open-weight LLM series for assistants, retrieval, coding, and self-hosted applications. The profile helps compare this specific size and instruction variant with newer open and proprietary models in the local snapshot.

Introducing Meta Llama 3

ReleaseOct 15, 2024

7.6

n/a

$1.20/M

* Cost is inverted: lower input, output, and blended prices rank higher.

Model Details

Runtime

Output Speed78.3 tok/s

First Token2.86s

Max Outputn/a

Pricing

Blended$1.20/M

All Benchmarks

Metric	Domain	Value	Rank
Artificial Analysis Intelligence Index	overall	7.6	#455
Artificial Analysis Math Index	math	11.0	#230
MMLU-Pro	reasoning	69.0%	#204
GPQA	reasoning	46.5%	#432
Humanity's Last Exam

Llama 3.1 Nemotron Instruct 70B

Model Snapshot

Overall

Coding

Blended Price

Strength Profile

Benchmark Percentiles

Model Details

Runtime

Pricing

All Benchmarks

Metadata