xAI
Grok 4 is an xAI Grok model profile, part of xAI's assistant and reasoning model family for real-time, coding, and general-purpose AI workflows. The benchmark snapshot highlights how this Grok variant compares with competing frontier and open-weight models on capability, speed, latency, and cost.
xAI model releasesQueryable facts extracted from the upstream model payload.
Rank #2 across 194 models.
Rank #6 across 201 models.
Rank #15 across 345 models.
Rank #302 across 325 models.
Rank #300 across 325 models.
Rank #300 across 325 models.
Percentile score by analysis domain.
* Cost is inverted: lower input, output, and blended prices rank higher.
Higher bars mean stronger relative placement.
| Metric | Domain | Value | Rank |
|---|---|---|---|
| Artificial Analysis Intelligence Index | overall | 41.5 | #65 |
| Artificial Analysis Coding Index | coding | 40.5 | #44 |
| Artificial Analysis Math Index | math | 92.7 | #16 |
| MMLU-Pro | reasoning | 86.6% | #15 |
| reasoning |
| 87.7% |
| #27 |
| Humanity's Last Exam | reasoning | 23.9% | #51 |
| LiveCodeBench | coding | 81.9% | #23 |
| SciCode | coding, reasoning | 45.7% | #38 |
| MATH-500 | math | 99.0% | #6 |
| AIME | math | 94.3% | #2 |
| Output Speed | speed | 50.3 tok/s | #226 |
| Time to First Token | speed | 7.89s | #250 |
| Blended Price | cost | $6.00/M | #300 |
| Input Price | cost | $3.00/M | #300 |
| Output Price | cost | $15.00/M | #302 |
| Value Index | cost, overall | 6.9 | #270 |