EBEasy BenchmarksLLM model index
Workspace
Overview
Benchmarks
Benchmarks list
Overall Index
Coding
Math
MMLU-Pro
Speed
Value
Models
All models
GPT-5.5 (xhigh)
GPT-5.5 (high)
Claude Opus 4.7 (Adaptive Reasoning, Max Effort)
Gemini 3.1 Pro Preview
GPT-5.4 (xhigh)
Artificial Analysis data
Back

Alibaba

Qwen3 4B 2507 (Reasoning)

Qwen3 4B 2507 (Reasoning) is a Alibaba language-model profile in the Easy Benchmarks snapshot. Use this page to compare its measured Artificial Analysis scores, output speed, time to first token, pricing, and relative ranking against other models in the local catalog.

Qwen model releases

Operational Metrics

Output Speedn/a
First Tokenn/a
Blended Price-

Model Metadata

Queryable facts extracted from the upstream model payload.

ReleaseAug 6, 2025
Context Windown/a
Modalitiesn/a
API fields: release_date

Strength: Math

Rank #57 across 269 models.

82.7

Watch Area: Coding

Rank #325 across 410 models.

9.5

Watch Area: SciCode

Rank #311 across 472 models.

25.6%

Strength Profile

Percentile score by analysis domain.

Benchmark Percentiles

Higher bars mean stronger relative placement.

All Benchmarks

MetricDomainValueRank
Artificial Analysis Intelligence Indexoverall18.2#266
Artificial Analysis Coding Indexcoding9.5#325
Artificial Analysis Math Indexmath82.7#57
MMLU-Proreasoning74.3%#187
GPQA
reasoning
66.7%
#231
Humanity's Last Examreasoning5.9%#244
LiveCodeBenchcoding64.1%#103
SciCodecoding, reasoning25.6%#311