OpenAI
GPT 4.1 is OpenAI's flagship model for complex tasks. It is well suited for problem solving across domains.
OpenAI model releasesRank #180 across 526
Rank #202 across 436
Rank #293 across 357
Percentile score by analysis domain.
* Cost is inverted: lower input, output, and blended prices rank higher.
Higher bars mean stronger relative placement.
| Metric | Domain | Value | Rank |
|---|---|---|---|
| Artificial Analysis Intelligence Index | overall | 26.3 | #180 |
| Artificial Analysis Coding Index | coding | 21.8 | #202 |
| Artificial Analysis Math Index | math | 34.7 | #175 |
| MMLU-Pro | reasoning | 80.6% | #104 |
| reasoning |
| 66.6% |
| #252 |
| Humanity's Last Exam | reasoning | 4.6% | #363 |
| LiveCodeBench | coding | 45.7% | #165 |
| SciCode | coding, reasoning | 38.1% | #150 |
| MATH-500 | math | 91.3% | #67 |
| AIME | math | 43.7% | #67 |
| Output Speed | speed | 128.3 tok/s | #102 |
| Time to First Token | speed | 0.56s | #62 |
| Blended Price | cost | $3.50/M | #293 |
| Input Price | cost | $2.00/M | #293 |
| Output Price | cost | $8.00/M | #276 |
| Value Index | cost, overall | 7.5 | #291 |