DeepSeek
DeepSeek-V3.1-Terminus delivers more stable & reliable outputs across benchmarks compared to the previous version and addresses user feedback (i.e. language consistency and agent upgrades).
DeepSeek release notesRank #162 across 526
Rank #117 across 436
Rank #130 across 357
Percentile score by analysis domain.
* Cost is inverted: lower input, output, and blended prices rank higher.
Higher bars mean stronger relative placement.
Streaming speed is not measured for this model yet.
| Metric | Domain | Value | Rank |
|---|---|---|---|
| Artificial Analysis Intelligence Index | overall | 28.5 | #162 |
| Artificial Analysis Coding Index | coding | 31.9 | #117 |
| Artificial Analysis Math Index | math | 53.7 | #134 |
| MMLU-Pro | reasoning | 83.6% | #51 |
| reasoning |
| 75.1% |
| #173 |
| Humanity's Last Exam | reasoning | 8.4% | #196 |
| LiveCodeBench | coding | 52.9% | #139 |
| SciCode | coding, reasoning | 32.1% | #245 |
| Blended Price | cost | $0.453/M | #130 |
| Input Price | cost | $0.270/M | #135 |
| Output Price | cost | $1.00/M | #134 |
| Value Index | cost, overall | 62.9 | #104 |