Text-to-Video
A video generation model that supports multi-shot storytelling. It excels in semantic understanding and instruction following, producing smooth, detailed, and cinematic 1080P HD videos.
1,082
Jun 2025
video
$0.024/s
Catalog
Category rows come directly from Artificial Analysis when the endpoint exposes category-level Elo scores.
| Category | Elo | 95% CI | Appearances |
|---|---|---|---|
| Text | 1,137 | -29/29 | 289 |
| Indoor | 1,123 | -18/18 | 778 |
| Buildings | 1,120 | -14/14 | 1,170 |
| People | 1,120 | -12/12 | 1,783 |
| Sports | 1,119 | -26/26 | 366 |
| Screens | 1,111 | -33/33 | 215 |
| Specific location or era | 1,111 | -23/23 | 466 |
| Multi-scene | 1,109 | -31/31 | 267 |
| Cartoon and anime |
| 1,103 |
| -25/25 |
| 391 |
| Moving camera | 1,100 | -23/23 | 481 |
| Fashion | 1,096 | -28/28 | 341 |
| Long prompt | 1,090 | -25/25 | 455 |