Kimi
Kimi K2 Thinking is an advanced open-source thinking model by Moonshot AI. It can execute up to 200 – 300 sequential tool calls without human interference, reasoning coherently across hundreds of steps to solve complex problems. Built as a thinking agent, it reasons step by step while using tools, achieving state-of-the-art performance on Humanity's Last Exam (HLE), BrowseComp, and other benchmarks, with major gains in reasoning, agentic search, coding, writing, and general capabilities.
Kimi model releasesRank #70 across 526
Rank #95 across 436
Rank #209 across 357
Percentile score by analysis domain.
* Cost is inverted: lower input, output, and blended prices rank higher.
Higher bars mean stronger relative placement.
| Metric | Domain | Value | Rank |
|---|---|---|---|
| Artificial Analysis Intelligence Index | overall | 40.9 | #70 |
| Artificial Analysis Coding Index | coding | 34.8 | #95 |
| Artificial Analysis Math Index | math | 94.7 | #11 |
| MMLU-Pro | reasoning | 84.8% | #36 |
| reasoning |
| 83.8% |
| #80 |
| Humanity's Last Exam | reasoning | 22.3% | #68 |
| LiveCodeBench | coding | 85.3% | #14 |
| SciCode | coding, reasoning | 42.4% | #76 |
| Output Speed | speed | 131.1 tok/s | #98 |
| Time to First Token | speed | 0.86s | #111 |
| Blended Price | cost | $1.08/M | #209 |
| Input Price | cost | $0.600/M | #214 |
| Output Price | cost | $2.50/M | #207 |
| Value Index | cost, overall | 38.0 | #142 |