Xiaomi
Xiaomi MiMo-V2-Flash is a proprietary MoE model developed by Xiaomi, designed for extreme inference efficiency with 309B total parameters (15B active). By incorporating an innovative Hybrid attention architecture and multi-layer MTP inference acceleration, it ranks among the top 2 global open-source models across multiple Agent benchmarks.
Xiaomi model releasesRank #67 across 526
Rank #107 across 436
Rank #46 across 357
Percentile score by analysis domain.
* Cost is inverted: lower input, output, and blended prices rank higher.
Higher bars mean stronger relative placement.
| Metric | Domain | Value | Rank |
|---|---|---|---|
| Artificial Analysis Intelligence Index | overall | 41.5 | #67 |
| Artificial Analysis Coding Index | coding | 33.5 | #107 |
| GPQA | reasoning | 83.5% | #83 |
| Humanity's Last Exam | reasoning | 20.0% | #78 |
| SciCode |
| coding, reasoning |
| 38.3% |
| #147 |
| Output Speed | speed | 124.9 tok/s | #107 |
| Time to First Token | speed | 1.40s | #191 |
| Blended Price | cost | $0.150/M | #46 |
| Input Price | cost | $0.100/M | #50 |
| Output Price | cost | $0.300/M | #53 |
| Value Index | cost, overall | 276.7 | #9 |