grok-imagine-video Text-to-Video

xAIText to Video

grok-imagine-video

State-of-the-art video generation across quality, cost, and latency. Grok Imagine is x.AI's most powerful video-audio generative model yet. Bring an image to life, start from a simple text prompt, or even refine a complex cinematic sequence.

Rank#5

Elo1,236

95% CI-8/8

Appearances6,338

ReleaseJan 2026

1,236

Jan 2026

video

$0.05/s

Catalog

Category	Elo	95% CI	Appearances
Cartoon and anime	1,337	-24/24	837
Fantasy	1,325	-29/29	555
Action	1,318	-22/22	952
Sci Fi	1,310	-20/20	1,251
Sports	1,305	-24/24	772
Long prompt	1,290	-22/22	1,028
Fashion	1,290	-26/26	716
Multi-scene	1,280	-29/29	563
3D animation

grok-imagine-video

grok-imagine-video

Benchmark Snapshot

Elo Rating

Release Date

Gateway Type

Price

Price Source

Category Breakdown

Metadata Sources