EBEasy BenchmarksLLM model index
Workspace
Overview
Benchmarks
Benchmarks list
Overall Index
Coding
Math
MMLU-Pro
Speed
Value
Models
All models
GPT-5.5 (xhigh)
GPT-5.5 (high)
Claude Opus 4.7 (Adaptive Reasoning, Max Effort)
Gemini 3.1 Pro Preview
GPT-5.4 (xhigh)
Artificial Analysis data
Back

Alibaba

Qwen3 14B (Reasoning)

Qwen3 14B (Reasoning) is a Alibaba language-model profile in the Easy Benchmarks snapshot. Use this page to compare its measured Artificial Analysis scores, output speed, time to first token, pricing, and relative ranking against other models in the local catalog.

Qwen model releases

Operational Metrics

Output Speed63 tok/s
First Token1.03s
Blended Price$1.31/M

Model Metadata

Queryable facts extracted from the upstream model payload.

ReleaseApr 28, 2025
Context Windown/a
Modalitiesn/a
API fields: release_date

Strength: AIME

Rank #29 across 194 models.

76.3%

Strength: MATH-500

Rank #34 across 201 models.

96.1%

Watch Area: HLE

Rank #379 across 474 models.

4.3%

Watch Area: Value

Rank #226 across 323 models.

12.3

Watch Area: Coding

Rank #279 across 410 models.

13.1

Strength Profile

Percentile score by analysis domain.

* Cost is inverted: lower input, output, and blended prices rank higher.

Benchmark Percentiles

Higher bars mean stronger relative placement.

All Benchmarks

MetricDomainValueRank
Artificial Analysis Intelligence Indexoverall16.2#292
Artificial Analysis Coding Indexcoding13.1#279
Artificial Analysis Math Indexmath55.7#131
MMLU-Proreasoning77.4%#150
GPQA
reasoning
60.4%
#272
Humanity's Last Examreasoning4.3%#379
LiveCodeBenchcoding52.3%#142
SciCodecoding, reasoning31.6%#229
MATH-500math96.1%#34
AIMEmath76.3%#29
Output Speedspeed63 tok/s#187
Time to First Tokenspeed1.03s#137
Blended Pricecost$1.31/M#197
Input Pricecost$0.350/M#149
Output Pricecost$4.20/M#216
Value Indexcost, overall12.3#226