EBEasy BenchmarksLLM model index
Workspace
Overview
Benchmarks
Benchmarks list
Overall Index
Coding
Math
MMLU-Pro
Speed
Value
Models
All models
GPT-5.5 (xhigh)
GPT-5.5 (high)
Claude Opus 4.7 (Adaptive Reasoning, Max Effort)
Gemini 3.1 Pro Preview
GPT-5.4 (xhigh)
Artificial Analysis data
Back

NVIDIA

Llama 3.1 Nemotron Instruct 70B

Llama 3.1 Nemotron Instruct 70B is a Meta Llama 3 family model, part of Meta's widely deployed open-weight LLM series for assistants, retrieval, coding, and self-hosted applications. The profile helps compare this specific size and instruction variant with newer open and proprietary models in the local snapshot.

Introducing Meta Llama 3

Operational Metrics

Output Speed36.4 tok/s
First Token0.42s
Blended Price$1.20/M

Model Metadata

Queryable facts extracted from the upstream model payload.

ReleaseOct 15, 2024
Context Windown/a
Modalitiesn/a
API fields: release_date

Strength: TTFT

Rank #22 across 293 models.

0.42s

Watch Area: Speed

Rank #260 across 293 models.

36.4 tok/s

Watch Area: Math

Rank #230 across 269 models.

11.0

Watch Area: LCB

Rank #283 across 343 models.

16.9%

Strength Profile

Percentile score by analysis domain.

* Cost is inverted: lower input, output, and blended prices rank higher.

Benchmark Percentiles

Higher bars mean stronger relative placement.

All Benchmarks

MetricDomainValueRank
Artificial Analysis Intelligence Indexoverall13.4#355
Artificial Analysis Coding Indexcoding10.8#306
Artificial Analysis Math Indexmath11.0#230
MMLU-Proreasoning69.0%#231
GPQA
reasoning
46.5%
#359
Humanity's Last Examreasoning4.6%#344
LiveCodeBenchcoding16.9%#283
SciCodecoding, reasoning23.3%#335
MATH-500math73.3%#133
AIMEmath24.7%#94
Output Speedspeed36.4 tok/s#260
Time to First Tokenspeed0.42s#22
Blended Pricecost$1.20/M#189
Input Pricecost$1.20/M#225
Output Pricecost$1.20/M#117
Value Indexcost, overall11.2#235