EBEasy BenchmarksLLM model index
Workspace
Overview
Benchmarks
Benchmarks list
Overall Index
Coding
Math
MMLU-Pro
Speed
Value
Models
All models
GPT-5.5 (xhigh)
GPT-5.5 (high)
Claude Opus 4.7 (Adaptive Reasoning, Max Effort)
Gemini 3.1 Pro Preview
GPT-5.4 (xhigh)
Artificial Analysis data
Back

Nous Research

Hermes 4 - Llama-3.1 405B (Reasoning)

Hermes 4 - Llama-3.1 405B (Reasoning) is a Nous Research language-model profile in the Easy Benchmarks snapshot. Use this page to compare its measured Artificial Analysis scores, output speed, time to first token, pricing, and relative ranking against other models in the local catalog.

Nous Research model releases

Operational Metrics

Output Speed34.9 tok/s
First Token0.89s
Blended Price$1.50/M

Model Metadata

Queryable facts extracted from the upstream model payload.

ReleaseAug 27, 2025
Context Windown/a
Modalitiesn/a
API fields: release_date

Strength: MMLU-Pro

Rank #62 across 345 models.

82.9%

Strength: LCB

Rank #80 across 343 models.

68.6%

Watch Area: Speed

Rank #265 across 293 models.

34.9 tok/s

Watch Area: Value

Rank #225 across 323 models.

12.4

Watch Area: Input $

Rank #218 across 325 models.

$1.00/M

Strength Profile

Percentile score by analysis domain.

* Cost is inverted: lower input, output, and blended prices rank higher.

Benchmark Percentiles

Higher bars mean stronger relative placement.

All Benchmarks

MetricDomainValueRank
Artificial Analysis Intelligence Indexoverall18.6#259
Artificial Analysis Coding Indexcoding16.0#230
Artificial Analysis Math Indexmath69.7#96
MMLU-Proreasoning82.9%#62
GPQA
reasoning
72.7%
#177
Humanity's Last Examreasoning10.3%#146
LiveCodeBenchcoding68.6%#80
SciCodecoding, reasoning25.2%#315
Output Speedspeed34.9 tok/s#265
Time to First Tokenspeed0.89s#110
Blended Pricecost$1.50/M#206
Input Pricecost$1.00/M#218
Output Pricecost$3.00/M#196
Value Indexcost, overall12.4#225