EBEasy BenchmarksLLM model index
Workspace
Overview
Benchmarks
Benchmarks list
Overall Index
Coding
Math
MMLU-Pro
Speed
Value
Models
All models
GPT-5.5 (xhigh)
GPT-5.5 (high)
Claude Opus 4.7 (Adaptive Reasoning, Max Effort)
Gemini 3.1 Pro Preview
GPT-5.4 (xhigh)
Artificial Analysis data
Back

Nous Research

Hermes 4 - Llama-3.1 70B (Non-reasoning)

Hermes 4 - Llama-3.1 70B (Non-reasoning) is a Nous Research language-model profile in the Easy Benchmarks snapshot. Use this page to compare its measured Artificial Analysis scores, output speed, time to first token, pricing, and relative ranking against other models in the local catalog.

Nous Research model releases

Operational Metrics

Output Speed83.5 tok/s
First Token0.59s
Blended Price$0.198/M

Model Metadata

Queryable facts extracted from the upstream model payload.

ReleaseAug 27, 2025
Context Windown/a
Modalitiesn/a
API fields: release_date

Strength: Output $

Rank #57 across 325 models.

$0.400/M

Strength: Blended $

Rank #58 across 325 models.

$0.198/M

Strength: Input $

Rank #58 across 325 models.

$0.130/M

Watch Area: HLE

Rank #448 across 474 models.

3.6%

Watch Area: Math

Rank #229 across 269 models.

11.3

Watch Area: Coding

Rank #328 across 410 models.

9.2

Strength Profile

Percentile score by analysis domain.

* Cost is inverted: lower input, output, and blended prices rank higher.

Benchmark Percentiles

Higher bars mean stronger relative placement.

All Benchmarks

MetricDomainValueRank
Artificial Analysis Intelligence Indexoverall12.6#372
Artificial Analysis Coding Indexcoding9.2#328
Artificial Analysis Math Indexmath11.3#229
MMLU-Proreasoning66.4%#246
GPQA
reasoning
49.1%
#344
Humanity's Last Examreasoning3.6%#448
LiveCodeBenchcoding26.9%#244
SciCodecoding, reasoning27.7%#282
Output Speedspeed83.5 tok/s#154
Time to First Tokenspeed0.59s#63
Blended Pricecost$0.198/M#58
Input Pricecost$0.130/M#58
Output Pricecost$0.400/M#57
Value Indexcost, overall63.6#84