EBEasy BenchmarksLLM model index
Workspace
Overview
Benchmarks
Benchmarks list
Overall Index
Coding
Math
MMLU-Pro
Speed
Value
Models
All models
GPT-5.5 (xhigh)
GPT-5.5 (high)
Claude Opus 4.7 (Adaptive Reasoning, Max Effort)
Gemini 3.1 Pro Preview
GPT-5.4 (xhigh)
Artificial Analysis data
Back

Alibaba

Qwen3 VL 235B A22B (Reasoning)

Qwen3 VL 235B A22B (Reasoning) is a Alibaba language-model profile in the Easy Benchmarks snapshot. Use this page to compare its measured Artificial Analysis scores, output speed, time to first token, pricing, and relative ranking against other models in the local catalog.

Qwen model releases

Operational Metrics

Output Speed46.2 tok/s
First Token1.21s
Blended Price$2.63/M

Model Metadata

Queryable facts extracted from the upstream model payload.

ReleaseSep 23, 2025
Context Windown/a
Modalitiesn/a
API fields: release_date

Strength: Math

Rank #36 across 269 models.

88.3

Strength: MMLU-Pro

Rank #54 across 345 models.

83.6%

Strength: SciCode

Rank #100 across 472 models.

39.9%

Watch Area: Speed

Rank #236 across 293 models.

46.2 tok/s

Watch Area: Output $

Rank #255 across 325 models.

$8.40/M

Watch Area: Value

Rank #241 across 323 models.

10.5

Strength Profile

Percentile score by analysis domain.

* Cost is inverted: lower input, output, and blended prices rank higher.

Benchmark Percentiles

Higher bars mean stronger relative placement.

All Benchmarks

MetricDomainValueRank
Artificial Analysis Intelligence Indexoverall27.6#164
Artificial Analysis Coding Indexcoding20.9#188
Artificial Analysis Math Indexmath88.3#36
MMLU-Proreasoning83.6%#54
GPQA
reasoning
77.2%
#131
Humanity's Last Examreasoning10.1%#150
LiveCodeBenchcoding64.6%#99
SciCodecoding, reasoning39.9%#100
Output Speedspeed46.2 tok/s#236
Time to First Tokenspeed1.21s#171
Blended Pricecost$2.63/M#237
Input Pricecost$0.700/M#202
Output Pricecost$8.40/M#255
Value Indexcost, overall10.5#241