Alibaba
The Qwen3.6 native vision-language Plus series models demonstrate exceptional performance on par with the current state-of-the-art models, with a significant improvement in overall results compared to the 3.5 series. The models have been markedly enhanced in code-related capabilities such as agentic coding, front-end programming, and Vibe coding, as well as in multi-modal general object recognition, OCR, and object localization.
Qwen model releasesRank #17 across 526
Rank #45 across 436
Rank #214 across 357
Percentile score by analysis domain.
* Cost is inverted: lower input, output, and blended prices rank higher.
Higher bars mean stronger relative placement.
| Metric | Domain | Value | Rank |
|---|---|---|---|
| Artificial Analysis Intelligence Index | overall | 50.0 | #17 |
| Artificial Analysis Coding Index | coding | 42.9 | #45 |
| GPQA | reasoning | 88.2% | #34 |
| Humanity's Last Exam | reasoning | 25.7% | #51 |
| SciCode |
| coding, reasoning |
| 40.7% |
| #97 |
| Output Speed | speed | 52.8 tok/s | #230 |
| Time to First Token | speed | 1.89s | #223 |
| Blended Price | cost | $1.13/M | #214 |
| Input Price | cost | $0.500/M | #198 |
| Output Price | cost | $3.00/M | #231 |
| Value Index | cost, overall | 44.4 | #133 |