OpenAI
OpenAI's o4-mini delivers fast, cost-efficient reasoning with exceptional performance for its size, particularly excelling in math (best-performing on AIME benchmarks), coding, and visual tasks.
Introducing o3 and o4-miniRank #122 across 526
Rank #162 across 436
Rank #251 across 357
Percentile score by analysis domain.
* Cost is inverted: lower input, output, and blended prices rank higher.
Higher bars mean stronger relative placement.
| Metric | Domain | Value | Rank |
|---|---|---|---|
| Artificial Analysis Intelligence Index | overall | 33.1 | #122 |
| Artificial Analysis Coding Index | coding | 25.6 | #162 |
| Artificial Analysis Math Index | math | 90.7 | #24 |
| MMLU-Pro | reasoning | 83.2% | #58 |
| reasoning |
| 78.4% |
| #132 |
| Humanity's Last Exam | reasoning | 17.5% | #95 |
| LiveCodeBench | coding | 85.9% | #12 |
| SciCode | coding, reasoning | 46.5% | #41 |
| MATH-500 | math | 98.9% | #7 |
| AIME | math | 94.0% | #3 |
| Output Speed | speed | 151 tok/s | #73 |
| Time to First Token | speed | 17.00s | #278 |
| Blended Price | cost | $1.93/M | #251 |
| Input Price | cost | $1.10/M | #247 |
| Output Price | cost | $4.40/M | #249 |
| Value Index | cost, overall | 17.2 | #226 |