OpenAI

o3

o3 is one of OpenAI's reasoning-focused models, built for harder multi-step tasks where deliberate problem solving matters more than simple chat completion. The benchmark snapshot highlights how that reasoning emphasis translates into scores, latency, and value versus general-purpose models.

Introducing o3 and o4-mini

Output Speed72.7 tok/s

First Token8.55s

Blended Price$3.50/M

ReleaseApr 16, 2025

Context Windown/a

Modalitiesn/a

API fields: release_date

99.2%

90.3%

80.8%

8.55s

$2.00/M

$3.50/M

* Cost is inverted: lower input, output, and blended prices rank higher.

All Benchmarks

Metric	Domain	Value	Rank
Artificial Analysis Intelligence Index	overall	38.4	#87
Artificial Analysis Coding Index	coding	38.4	#57
Artificial Analysis Math Index	math	88.3	#35
MMLU-Pro	reasoning	85.3%	#29

Back

OpenAI

o3

Introducing o3 and o4-mini

Output Speed72.7 tok/s

First Token8.55s

Blended Price$3.50/M

ReleaseApr 16, 2025

Context Windown/a

Modalitiesn/a

API fields: release_date

99.2%

90.3%

80.8%

8.55s

$2.00/M

$3.50/M

* Cost is inverted: lower input, output, and blended prices rank higher.

All Benchmarks

Metric	Domain	Value	Rank
Artificial Analysis Intelligence Index	overall	38.4	#87
Artificial Analysis Coding Index	coding	38.4	#57
Artificial Analysis Math Index	math	88.3	#35
MMLU-Pro	reasoning	85.3%	#29

o3

Operational Metrics

Model Metadata

Strength: MATH-500

Strength: AIME

Strength: LCB

Watch Area: TTFT

Watch Area: Input $

Watch Area: Blended $

Strength Profile

Benchmark Percentiles

All Benchmarks

o3

Operational Metrics

Model Metadata

Strength: MATH-500

Strength: AIME

Strength: LCB

Watch Area: TTFT

Watch Area: Input $

Watch Area: Blended $

Strength Profile

Benchmark Percentiles

All Benchmarks