Mistral
Devstral is an agentic LLM for software engineering tasks built under a collaboration between Mistral AI and All Hands AI 🙌. Devstral excels at using tools to explore codebases, editing multiple files and power software engineering agents.
Introducing Mistral 3Rank #317 across 526
Rank #309 across 436
Rank #42 across 357
Percentile score by analysis domain.
* Cost is inverted: lower input, output, and blended prices rank higher.
Higher bars mean stronger relative placement.
| Metric | Domain | Value | Rank |
|---|---|---|---|
| Artificial Analysis Intelligence Index | overall | 15.2 | #317 |
| Artificial Analysis Coding Index | coding | 12.1 | #309 |
| Artificial Analysis Math Index | math | 29.3 | #187 |
| MMLU-Pro | reasoning | 62.2% | #263 |
| reasoning |
| 41.4% |
| #408 |
| Humanity's Last Exam | reasoning | 3.7% | #459 |
| LiveCodeBench | coding | 25.4% | #251 |
| SciCode | coding, reasoning | 24.3% | #346 |
| MATH-500 | math | 63.5% | #157 |
| AIME | math | 0.3% | #180 |
| Output Speed | speed | 42.3 tok/s | #270 |
| Time to First Token | speed | 0.51s | #39 |
| Blended Price | cost | $0.150/M | #42 |
| Input Price | cost | $0.100/M | #39 |
| Output Price | cost | $0.300/M | #49 |
| Value Index | cost, overall | 101.3 | #54 |