EBEasy BenchmarksLLM model index
Workspace
Overview
Benchmarks
Benchmarks list
Overall Index
Coding
Math
MMLU-Pro
Speed
Value
Models
All models
GPT-5.5 (xhigh)
GPT-5.5 (high)
Claude Opus 4.7 (Adaptive Reasoning, Max Effort)
Gemini 3.1 Pro Preview
GPT-5.4 (xhigh)
Artificial Analysis data
Back

xAI

Grok 4.20 0309 v2 (Reasoning)

Grok 4.20 0309 v2 (Reasoning) is an xAI Grok model profile, part of xAI's assistant and reasoning model family for real-time, coding, and general-purpose AI workflows. The benchmark snapshot highlights how this Grok variant compares with competing frontier and open-weight models on capability, speed, latency, and cost.

xAI model releases

Operational Metrics

Output Speed89.3 tok/s
First Token24.38s
Blended Price$3.00/M

Model Metadata

Queryable facts extracted from the upstream model payload.

ReleaseApr 7, 2026
Context Windown/a
Modalitiesn/a
API fields: release_date

Strength: GPQA

Rank #8 across 478 models.

91.1%

Strength: HLE

Rank #19 across 474 models.

32.2%

Strength: Overall

Rank #24 across 500 models.

49.3

Watch Area: TTFT

Rank #280 across 293 models.

24.38s

Watch Area: Input $

Rank #268 across 325 models.

$2.00/M

Watch Area: Blended $

Rank #245 across 325 models.

$3.00/M

Strength Profile

Percentile score by analysis domain.

* Cost is inverted: lower input, output, and blended prices rank higher.

Benchmark Percentiles

Higher bars mean stronger relative placement.

All Benchmarks

MetricDomainValueRank
Artificial Analysis Intelligence Indexoverall49.3#24
Artificial Analysis Coding Indexcoding40.5#45
GPQAreasoning91.1%#8
Humanity's Last Examreasoning32.2%#19
SciCode
coding, reasoning
45.6%
#39
Output Speedspeed89.3 tok/s#138
Time to First Tokenspeed24.38s#280
Blended Pricecost$3.00/M#245
Input Pricecost$2.00/M#268
Output Pricecost$6.00/M#233
Value Indexcost, overall16.4#206