EBEasy BenchmarksLLM model index
Workspace
Overview
Benchmarks
Benchmarks list
Overall Index
Coding
Math
MMLU-Pro
Speed
Value
Models
All models
GPT-5.5 (xhigh)
GPT-5.5 (high)
Claude Opus 4.7 (Adaptive Reasoning, Max Effort)
Gemini 3.1 Pro Preview
GPT-5.4 (xhigh)
Artificial Analysis data
Back

Alibaba

Qwen3 Max Thinking (Preview)

Qwen3 Max Thinking (Preview) is a Alibaba language-model profile in the Easy Benchmarks snapshot. Use this page to compare its measured Artificial Analysis scores, output speed, time to first token, pricing, and relative ranking against other models in the local catalog.

Qwen model releases

Operational Metrics

Output Speed40.8 tok/s
First Token1.78s
Blended Price$2.40/M

Model Metadata

Queryable facts extracted from the upstream model payload.

ReleaseNov 3, 2025
Context Windown/a
Modalitiesn/a
API fields: release_date

Strength: MMLU-Pro

Rank #69 across 345 models.

82.4%

Strength: Math

Rank #58 across 269 models.

82.3

Strength: Overall

Rank #124 across 500 models.

32.5

Watch Area: Speed

Rank #254 across 293 models.

40.8 tok/s

Watch Area: TTFT

Rank #221 across 293 models.

1.78s

Watch Area: Output $

Rank #240 across 325 models.

$6.00/M

Strength Profile

Percentile score by analysis domain.

* Cost is inverted: lower input, output, and blended prices rank higher.

Benchmark Percentiles

Higher bars mean stronger relative placement.

All Benchmarks

MetricDomainValueRank
Artificial Analysis Intelligence Indexoverall32.5#124
Artificial Analysis Coding Indexcoding24.5#156
Artificial Analysis Math Indexmath82.3#58
MMLU-Proreasoning82.4%#69
GPQA
reasoning
77.6%
#127
Humanity's Last Examreasoning12.0%#124
LiveCodeBenchcoding53.5%#138
SciCodecoding, reasoning38.7%#125
Output Speedspeed40.8 tok/s#254
Time to First Tokenspeed1.78s#221
Blended Pricecost$2.40/M#233
Input Pricecost$1.20/M#229
Output Pricecost$6.00/M#240
Value Indexcost, overall13.5#217