EBEasy BenchmarksLLM model index
Workspace
Overview
Benchmarks
Benchmarks list
Overall Index
Coding
Math
MMLU-Pro
Speed
Value
Models
All models
GPT-5.5 (xhigh)
GPT-5.5 (high)
Claude Opus 4.7 (Adaptive Reasoning, Max Effort)
Gemini 3.1 Pro Preview
GPT-5.4 (xhigh)
Artificial Analysis data
Back

Alibaba

Qwen3 235B A22B (Reasoning)

Qwen3 235B A22B (Reasoning) is a Alibaba language-model profile in the Easy Benchmarks snapshot. Use this page to compare its measured Artificial Analysis scores, output speed, time to first token, pricing, and relative ranking against other models in the local catalog.

Qwen model releases

Operational Metrics

Output Speed61.4 tok/s
First Token1.31s
Blended Price$2.63/M

Model Metadata

Queryable facts extracted from the upstream model payload.

ReleaseApr 28, 2025
Context Windown/a
Modalitiesn/a
API fields: release_date

Strength: AIME

Rank #19 across 194 models.

84.0%

Strength: MMLU-Pro

Rank #65 across 345 models.

82.8%

Strength: SciCode

Rank #99 across 472 models.

39.9%

Watch Area: Value

Rank #263 across 323 models.

7.5

Watch Area: Output $

Rank #252 across 325 models.

$8.40/M

Watch Area: Blended $

Rank #234 across 325 models.

$2.63/M

Strength Profile

Percentile score by analysis domain.

* Cost is inverted: lower input, output, and blended prices rank higher.

Benchmark Percentiles

Higher bars mean stronger relative placement.

All Benchmarks

MetricDomainValueRank
Artificial Analysis Intelligence Indexoverall19.8#239
Artificial Analysis Coding Indexcoding17.4#217
Artificial Analysis Math Indexmath82.0#61
MMLU-Proreasoning82.8%#65
GPQA
reasoning
70.0%
#200
Humanity's Last Examreasoning11.7%#127
LiveCodeBenchcoding62.2%#110
SciCodecoding, reasoning39.9%#99
MATH-500math93.0%#59
AIMEmath84.0%#19
Output Speedspeed61.4 tok/s#189
Time to First Tokenspeed1.31s#184
Blended Pricecost$2.63/M#234
Input Pricecost$0.700/M#197
Output Pricecost$8.40/M#252
Value Indexcost, overall7.5#263