EBEasy BenchmarksLLM model index
Workspace
Overview
Benchmarks
Benchmarks list
Overall Index
Coding
Math
MMLU-Pro
Speed
Value
Models
All models
GPT-5.5 (xhigh)
GPT-5.5 (high)
Claude Opus 4.7 (Adaptive Reasoning, Max Effort)
Gemini 3.1 Pro Preview
GPT-5.4 (xhigh)
Artificial Analysis data
Back

Microsoft

Phi-4

Phi-4 is a Microsoft language-model profile in the Easy Benchmarks snapshot. Use this page to compare its measured Artificial Analysis scores, output speed, time to first token, pricing, and relative ranking against other models in the local catalog.

Azure AI Foundry models

Operational Metrics

Output Speed41.6 tok/s
First Token0.51s
Blended Price$0.219/M

Model Metadata

Queryable facts extracted from the upstream model payload.

ReleaseDec 12, 2024
Context Windown/a
Modalitiesn/a
API fields: release_date

Strength: TTFT

Rank #48 across 293 models.

0.51s

Strength: Input $

Rank #56 across 325 models.

$0.125/M

Strength: Blended $

Rank #61 across 325 models.

$0.219/M

Watch Area: Speed

Rank #250 across 293 models.

41.6 tok/s

Watch Area: HLE

Rank #402 across 474 models.

4.1%

Watch Area: Overall

Rank #412 across 500 models.

10.4

Strength Profile

Percentile score by analysis domain.

* Cost is inverted: lower input, output, and blended prices rank higher.

Benchmark Percentiles

Higher bars mean stronger relative placement.

All Benchmarks

MetricDomainValueRank
Artificial Analysis Intelligence Indexoverall10.4#412
Artificial Analysis Coding Indexcoding11.2#296
Artificial Analysis Math Indexmath18.0#214
MMLU-Proreasoning71.4%#210
GPQA
reasoning
57.5%
#294
Humanity's Last Examreasoning4.1%#402
LiveCodeBenchcoding23.1%#263
SciCodecoding, reasoning26.0%#307
MATH-500math81.0%#106
AIMEmath14.3%#114
Output Speedspeed41.6 tok/s#250
Time to First Tokenspeed0.51s#48
Blended Pricecost$0.219/M#61
Input Pricecost$0.125/M#56
Output Pricecost$0.500/M#75
Value Indexcost, overall47.5#101