Text-to-Video
Veo 3.1 Fast is a specialized, high-speed variant of Google DeepMind’s Veo 3.1 text-to-video model, optimized for rapid generation of 8-second, high-fidelity videos. It is designed to create cinematic, 1080p, or 720p content with improved prompt adherence and native audio, making it ideal for creating quick, high-quality video clips, social media content, and ad creatives.
1,213
Jan 2026
video
$0.10/s
Catalog
Category rows come directly from Artificial Analysis when the endpoint exposes category-level Elo scores.
| Category | Elo | 95% CI | Appearances |
|---|---|---|---|
| 3D animation | 1,315 | -45/45 | 196 |
| Sports | 1,276 | -26/26 | 616 |
| Fashion | 1,258 | -28/28 | 490 |
| Text | 1,254 | -29/29 | 500 |
| Sci Fi | 1,254 | -21/21 | 925 |
| Transport | 1,252 | -16/16 | 1,577 |
| People | 1,249 | -12/12 | 2,814 |
| Cartoon and anime | 1,247 | -25/25 | 601 |
| Photorealistic |
| 1,246 |
| -40/40 |
| 236 |
| Specific location or era | 1,245 | -22/22 | 845 |
| Fantasy | 1,241 | -32/32 | 418 |
| Indoor | 1,238 | -18/18 | 1,265 |