EBEasy BenchmarksLLM model index
Workspace
Overview
Benchmarks
Benchmarks list
Overall Index
Coding
Math
MMLU-Pro
Speed
Value
Models
All models
GPT-5.5 (xhigh)
GPT-5.5 (high)
Claude Opus 4.7 (Adaptive Reasoning, Max Effort)
Gemini 3.1 Pro Preview
GPT-5.4 (xhigh)
Artificial Analysis data
Back

Anthropic

Claude Opus 4.5 (Reasoning)

Claude Opus 4.5 (Reasoning) is an Anthropic Claude Opus profile, representing the high-capability end of the Claude lineup for difficult coding, agentic, research, and computer-use tasks. This page compares the exact benchmarked variant against other Claude and non-Claude models on capability, runtime, and price.

Introducing Claude Opus

Operational Metrics

Output Speed57 tok/s
First Token9.84s
Blended Price$10.00/M

Model Metadata

Queryable facts extracted from the upstream model payload.

ReleaseNov 24, 2025
Context Windown/a
Modalitiesn/a
API fields: release_date

Strength: MMLU-Pro

Rank #2 across 345 models.

89.5%

Strength: LCB

Rank #8 across 343 models.

87.1%

Strength: Coding

Rank #15 across 410 models.

47.8

Watch Area: Blended $

Rank #304 across 325 models.

$10.00/M

Watch Area: Output $

Rank #304 across 325 models.

$25.00/M

Watch Area: Input $

Rank #303 across 325 models.

$5.00/M

Strength Profile

Percentile score by analysis domain.

* Cost is inverted: lower input, output, and blended prices rank higher.

Benchmark Percentiles

Higher bars mean stronger relative placement.

All Benchmarks

MetricDomainValueRank
Artificial Analysis Intelligence Indexoverall49.7#22
Artificial Analysis Coding Indexcoding47.8#15
Artificial Analysis Math Indexmath91.3#20
MMLU-Proreasoning89.5%#2
GPQA
reasoning
86.6%
#38
Humanity's Last Examreasoning28.4%#28
LiveCodeBenchcoding87.1%#8
SciCodecoding, reasoning49.5%#23
Output Speedspeed57 tok/s#201
Time to First Tokenspeed9.84s#259
Blended Pricecost$10.00/M#304
Input Pricecost$5.00/M#303
Output Pricecost$25.00/M#304
Value Indexcost, overall5.0#290