Image-to-Video
State-of-the-art video generation across quality, cost, and latency. Grok Imagine is x.AI's most powerful video-audio generative model yet. Bring an image to life, start from a simple text prompt, or even refine a complex cinematic sequence.
1,329
Jan 2026
video
$0.05/s
Catalog
Category rows come directly from Artificial Analysis when the endpoint exposes category-level Elo scores.
| Category | Elo | 95% CI | Appearances |
|---|---|---|---|
| Abstract | 1,424 | -37/37 | 498 |
| Cartoon and anime | 1,411 | -37/37 | 535 |
| Fantasy | 1,405 | -34/34 | 629 |
| 3D animation | 1,403 | -34/34 | 615 |
| Action | 1,390 | -26/26 | 905 |
| Sci Fi | 1,368 | -55/55 | 181 |
| Food | 1,359 | -26/26 | 1,075 |
| Buildings | 1,357 | -14/14 | 3,157 |
| Moving camera |
| 1,356 |
| -14/14 |
| 3,274 |
| Screens | 1,355 | -34/34 | 551 |
| Transport | 1,353 | -19/19 | 1,708 |
| Indoor | 1,338 | -16/16 | 2,799 |