EBEasy BenchmarksLLM model index
Workspace
Overview
Benchmarks
Benchmarks list
Overall Index
Coding
Math
MMLU-Pro
Speed
Value
Models
All models
GPT-5.5 (xhigh)
GPT-5.5 (high)
Claude Opus 4.7 (Adaptive Reasoning, Max Effort)
Gemini 3.1 Pro Preview
GPT-5.4 (xhigh)
Artificial Analysis data
Back

Alibaba

Qwen3 235B A22B 2507 Instruct

Qwen3 235B A22B 2507 Instruct is a Alibaba language-model profile in the Easy Benchmarks snapshot. Use this page to compare its measured Artificial Analysis scores, output speed, time to first token, pricing, and relative ranking against other models in the local catalog.

Qwen model releases

Operational Metrics

Output Speed64.7 tok/s
First Token1.10s
Blended Price$1.23/M

Model Metadata

Queryable facts extracted from the upstream model payload.

ReleaseJul 21, 2025
Context Windown/a
Modalitiesn/a
API fields: release_date

Strength: MATH-500

Rank #19 across 201 models.

98.0%

Strength: AIME

Rank #36 across 194 models.

71.7%

Strength: MMLU-Pro

Rank #66 across 345 models.

82.8%

Strength Profile

Percentile score by analysis domain.

* Cost is inverted: lower input, output, and blended prices rank higher.

Benchmark Percentiles

Higher bars mean stronger relative placement.

All Benchmarks

MetricDomainValueRank
Artificial Analysis Intelligence Indexoverall25.0#189
Artificial Analysis Coding Indexcoding22.1#177
Artificial Analysis Math Indexmath71.7#92
MMLU-Proreasoning82.8%#66
GPQA
reasoning
75.3%
#153
Humanity's Last Examreasoning10.6%#142
LiveCodeBenchcoding52.4%#141
SciCodecoding, reasoning36.0%#173
MATH-500math98.0%#19
AIMEmath71.7%#36
Output Speedspeed64.7 tok/s#183
Time to First Tokenspeed1.10s#152
Blended Pricecost$1.23/M#191
Input Pricecost$0.700/M#199
Output Pricecost$2.80/M#188
Value Indexcost, overall20.4#190