Alibaba
Qwen3 4B (Reasoning) is a Alibaba language-model profile in the Easy Benchmarks snapshot. Use this page to compare its measured Artificial Analysis scores, output speed, time to first token, pricing, and relative ranking against other models in the local catalog.
Qwen model releasesRank #346 across 526
No rank
Rank #123 across 357
Percentile score by analysis domain.
* Cost is inverted: lower input, output, and blended prices rank higher.
Higher bars mean stronger relative placement.
Streaming speed is not measured for this model yet.
| Metric | Domain | Value | Rank |
|---|---|---|---|
| Artificial Analysis Intelligence Index | overall | 14.2 | #346 |
| Artificial Analysis Math Index | math | 22.3 | #203 |
| MMLU-Pro | reasoning | 69.6% | #225 |
| GPQA | reasoning | 52.2% | #345 |
| Humanity's Last Exam |
| reasoning |
| 5.1% |
| #316 |
| LiveCodeBench | coding | 46.5% | #163 |
| SciCode | coding, reasoning | 3.5% | #479 |
| MATH-500 | math | 93.3% | #54 |
| AIME | math | 65.7% | #49 |
| Blended Price | cost | $0.398/M | #123 |
| Input Price | cost | $0.110/M | #66 |
| Output Price | cost | $1.26/M | #158 |
| Value Index | cost, overall | 35.7 | #151 |