Anthropic

Claude 4 Opus (Reasoning)

Claude 4 Opus (Reasoning) is an Anthropic Claude Opus profile, representing the high-capability end of the Claude lineup for difficult coding, agentic, research, and computer-use tasks. This page compares the exact benchmarked variant against other Claude and non-Claude models on capability, runtime, and price.

Introducing Claude Opus

Output Speed36.8 tok/s

First Token7.03s

Blended Price$30.00/M

ReleaseMay 22, 2025

Context Windown/a

Modalitiesn/a

API fields: release_date

87.3%

98.2%

75.7%

$75.00/M

$30.00/M

$15.00/M

* Cost is inverted: lower input, output, and blended prices rank higher.

All Benchmarks

Metric	Domain	Value	Rank
Artificial Analysis Intelligence Index	overall	39.0	#80
Artificial Analysis Coding Index	coding	34.0	#88
Artificial Analysis Math Index	math	73.3	#85
MMLU-Pro	reasoning	87.3%	#11

Back

Anthropic

Claude 4 Opus (Reasoning)

Introducing Claude Opus

Output Speed36.8 tok/s

First Token7.03s

Blended Price$30.00/M

ReleaseMay 22, 2025

Context Windown/a

Modalitiesn/a

API fields: release_date

87.3%

98.2%

75.7%

$75.00/M

$30.00/M

$15.00/M

* Cost is inverted: lower input, output, and blended prices rank higher.

All Benchmarks

Metric	Domain	Value	Rank
Artificial Analysis Intelligence Index	overall	39.0	#80
Artificial Analysis Coding Index	coding	34.0	#88
Artificial Analysis Math Index	math	73.3	#85
MMLU-Pro	reasoning	87.3%	#11

Claude 4 Opus (Reasoning)

Operational Metrics

Model Metadata

Strength: MMLU-Pro

Strength: MATH-500

Strength: AIME

Watch Area: Output $

Watch Area: Blended $

Watch Area: Input $

Strength Profile

Benchmark Percentiles

All Benchmarks

Claude 4 Opus (Reasoning)

Operational Metrics

Model Metadata

Strength: MMLU-Pro

Strength: MATH-500

Strength: AIME

Watch Area: Output $

Watch Area: Blended $

Watch Area: Input $

Strength Profile

Benchmark Percentiles

All Benchmarks