Inception
A diffusion-based reasoning LLM that generates text via parallel refinement (not token-by-token), delivering real-time latency with ~1k tokens/sec plus 128K context and built-in tool/JSON support.
Inception model updatesRank #126 across 526
Rank #127 across 436
Rank #118 across 357
Percentile score by analysis domain.
* Cost is inverted: lower input, output, and blended prices rank higher.
Higher bars mean stronger relative placement.
| Metric | Domain | Value | Rank |
|---|---|---|---|
| Artificial Analysis Intelligence Index | overall | 32.8 | #126 |
| Artificial Analysis Coding Index | coding | 30.6 | #127 |
| GPQA | reasoning | 77.0% | #148 |
| Humanity's Last Exam | reasoning | 15.5% | #105 |
| SciCode |
| coding, reasoning |
| 38.7% |
| #139 |
| Output Speed | speed | 758.7 tok/s | #1 |
| Time to First Token | speed | 2.83s | #242 |
| Blended Price | cost | $0.375/M | #118 |
| Input Price | cost | $0.250/M | #130 |
| Output Price | cost | $0.750/M | #113 |
| Value Index | cost, overall | 87.5 | #70 |