Alibaba
Qwen3 4B (Non-reasoning) is a Alibaba language-model profile in the Easy Benchmarks snapshot. Use this page to compare its measured Artificial Analysis scores, output speed, time to first token, pricing, and relative ranking against other models in the local catalog.
Qwen model releasesQueryable facts extracted from the upstream model payload.
Rank #54 across 325 models.
Rank #56 across 325 models.
Rank #69 across 325 models.
Rank #442 across 474 models.
Rank #396 across 478 models.
Rank #390 across 472 models.
Percentile score by analysis domain.
* Cost is inverted: lower input, output, and blended prices rank higher.
Higher bars mean stronger relative placement.
| Metric | Domain | Value | Rank |
|---|---|---|---|
| Artificial Analysis Intelligence Index | overall | 12.5 | #376 |
| MMLU-Pro | reasoning | 58.6% | #269 |
| GPQA | reasoning | 39.8% | #396 |
| Humanity's Last Exam | reasoning | 3.7% | #442 |
| LiveCodeBench |
| coding |
| 23.3% |
| #261 |
| SciCode | coding, reasoning | 16.7% | #390 |
| MATH-500 | math | 84.3% | #99 |
| AIME | math | 21.3% | #102 |
| Output Speed | speed | 104.4 tok/s | #114 |
| Time to First Token | speed | 0.95s | #123 |
| Blended Price | cost | $0.188/M | #56 |
| Input Price | cost | $0.110/M | #54 |
| Output Price | cost | $0.420/M | #69 |
| Value Index | cost, overall | 66.5 | #81 |