DeepSeek
DeepSeek-V3.1-Terminus delivers more stable & reliable outputs across benchmarks compared to the previous version and addresses user feedback (i.e. language consistency and agent upgrades).
DeepSeek release notesRank #116 across 526
Rank #105 across 436
Rank #248 across 357
Percentile score by analysis domain.
* Cost is inverted: lower input, output, and blended prices rank higher.
Higher bars mean stronger relative placement.
Streaming speed is not measured for this model yet.
| Metric | Domain | Value | Rank |
|---|---|---|---|
| Artificial Analysis Intelligence Index | overall | 33.9 | #116 |
| Artificial Analysis Coding Index | coding | 33.7 | #105 |
| Artificial Analysis Math Index | math | 89.7 | #27 |
| MMLU-Pro | reasoning | 85.1% | #31 |
| reasoning |
| 79.2% |
| #122 |
| Humanity's Last Exam | reasoning | 15.2% | #107 |
| LiveCodeBench | coding | 79.8% | #29 |
| SciCode | coding, reasoning | 40.6% | #98 |
| Blended Price | cost | $1.91/M | #248 |
| Input Price | cost | $1.64/M | #282 |
| Output Price | cost | $2.75/M | #219 |
| Value Index | cost, overall | 17.7 | #223 |