EBEasy BenchmarksLLM model index
Workspace
Overview
Benchmarks
Benchmarks list
Overall Index
Coding
Math
MMLU-Pro
Speed
Value
Models
All models
GPT-5.5 (xhigh)
GPT-5.5 (high)
Claude Opus 4.7 (Adaptive Reasoning, Max Effort)
Gemini 3.1 Pro Preview
GPT-5.4 (xhigh)
Artificial Analysis data
Back

Anthropic

Claude 4.1 Opus (Reasoning)

Claude 4.1 Opus (Reasoning) is an Anthropic Claude Opus profile, representing the high-capability end of the Claude lineup for difficult coding, agentic, research, and computer-use tasks. This page compares the exact benchmarked variant against other Claude and non-Claude models on capability, runtime, and price.

Introducing Claude Opus

Operational Metrics

Output Speed35.8 tok/s
First Token8.74s
Blended Price$30.00/M

Model Metadata

Queryable facts extracted from the upstream model payload.

ReleaseAug 5, 2025
Context Windown/a
Modalitiesn/a
API fields: release_date

Strength: MMLU-Pro

Rank #7 across 345 models.

88.0%

Strength: Overall

Rank #59 across 500 models.

42.0

Strength: Coding

Rank #68 across 410 models.

36.5

Watch Area: Output $

Rank #322 across 325 models.

$75.00/M

Watch Area: Blended $

Rank #321 across 325 models.

$30.00/M

Watch Area: Input $

Rank #319 across 325 models.

$15.00/M

Strength Profile

Percentile score by analysis domain.

* Cost is inverted: lower input, output, and blended prices rank higher.

Benchmark Percentiles

Higher bars mean stronger relative placement.

All Benchmarks

MetricDomainValueRank
Artificial Analysis Intelligence Indexoverall42.0#59
Artificial Analysis Coding Indexcoding36.5#68
Artificial Analysis Math Indexmath80.3#64
MMLU-Proreasoning88.0%#7
GPQA
reasoning
80.9%
#95
Humanity's Last Examreasoning11.9%#125
LiveCodeBenchcoding65.4%#94
SciCodecoding, reasoning40.9%#80
Output Speedspeed35.8 tok/s#262
Time to First Tokenspeed8.74s#256
Blended Pricecost$30.00/M#321
Input Pricecost$15.00/M#319
Output Pricecost$75.00/M#322
Value Indexcost, overall1.4#312