EBEasy BenchmarksLLM model index
Workspace
Overview
Benchmarks
Benchmarks list
Overall Index
Coding
Math
MMLU-Pro
Speed
Value
Models
All models
GPT-5.5 (xhigh)
GPT-5.5 (high)
Claude Opus 4.7 (Adaptive Reasoning, Max Effort)
Gemini 3.1 Pro Preview
GPT-5.4 (xhigh)
Artificial Analysis data
Back

Alibaba

Qwen3 4B (Reasoning)

Qwen3 4B (Reasoning) is a Alibaba language-model profile in the Easy Benchmarks snapshot. Use this page to compare its measured Artificial Analysis scores, output speed, time to first token, pricing, and relative ranking against other models in the local catalog.

Qwen model releases

Operational Metrics

Output Speed101.8 tok/s
First Token1.07s
Blended Price$0.398/M

Model Metadata

Queryable facts extracted from the upstream model payload.

ReleaseApr 28, 2025
Context Windown/a
Modalitiesn/a
API fields: release_date

Strength: Input $

Rank #55 across 325 models.

$0.110/M

Strength: AIME

Rank #49 across 194 models.

65.7%

Watch Area: SciCode

Rank #455 across 472 models.

3.5%

Watch Area: Math

Rank #203 across 269 models.

22.3

Watch Area: Overall

Rank #340 across 500 models.

14.2

Strength Profile

Percentile score by analysis domain.

* Cost is inverted: lower input, output, and blended prices rank higher.

Benchmark Percentiles

Higher bars mean stronger relative placement.

All Benchmarks

MetricDomainValueRank
Artificial Analysis Intelligence Indexoverall14.2#340
Artificial Analysis Math Indexmath22.3#203
MMLU-Proreasoning69.6%#225
GPQAreasoning52.2%#324
Humanity's Last Exam
reasoning
5.1%
#293
LiveCodeBenchcoding46.5%#163
SciCodecoding, reasoning3.5%#455
MATH-500math93.3%#54
AIMEmath65.7%#49
Output Speedspeed101.8 tok/s#117
Time to First Tokenspeed1.07s#144
Blended Pricecost$0.398/M#103
Input Pricecost$0.110/M#55
Output Pricecost$1.26/M#131
Value Indexcost, overall35.7#127