Microsoft
Phi-4 is a Microsoft language-model profile in the Easy Benchmarks snapshot. Use this page to compare its measured Artificial Analysis scores, output speed, time to first token, pricing, and relative ranking against other models in the local catalog.
Azure AI Foundry modelsQueryable facts extracted from the upstream model payload.
Rank #48 across 293 models.
Rank #56 across 325 models.
Rank #61 across 325 models.
Rank #250 across 293 models.
Rank #402 across 474 models.
Rank #412 across 500 models.
Percentile score by analysis domain.
* Cost is inverted: lower input, output, and blended prices rank higher.
Higher bars mean stronger relative placement.
| Metric | Domain | Value | Rank |
|---|---|---|---|
| Artificial Analysis Intelligence Index | overall | 10.4 | #412 |
| Artificial Analysis Coding Index | coding | 11.2 | #296 |
| Artificial Analysis Math Index | math | 18.0 | #214 |
| MMLU-Pro | reasoning | 71.4% | #210 |
| reasoning |
| 57.5% |
| #294 |
| Humanity's Last Exam | reasoning | 4.1% | #402 |
| LiveCodeBench | coding | 23.1% | #263 |
| SciCode | coding, reasoning | 26.0% | #307 |
| MATH-500 | math | 81.0% | #106 |
| AIME | math | 14.3% | #114 |
| Output Speed | speed | 41.6 tok/s | #250 |
| Time to First Token | speed | 0.51s | #48 |
| Blended Price | cost | $0.219/M | #61 |
| Input Price | cost | $0.125/M | #56 |
| Output Price | cost | $0.500/M | #75 |
| Value Index | cost, overall | 47.5 | #101 |