Text-to-Video
Veo 3.1 Fast is a specialized, high-speed variant of Google DeepMind’s Veo 3.1 text-to-video model, optimized for rapid generation of 8-second, high-fidelity videos. It is designed to create cinematic, 1080p, or 720p content with improved prompt adherence and native audio, making it ideal for creating quick, high-quality video clips, social media content, and ad creatives.
1,220
Oct 2025
video
$0.10/s
Catalog
Category rows come directly from Artificial Analysis when the endpoint exposes category-level Elo scores.
| Category | Elo | 95% CI | Appearances |
|---|---|---|---|
| Text | 1,288 | -29/29 | 453 |
| Specific location or era | 1,268 | -22/22 | 819 |
| Sports | 1,265 | -25/25 | 546 |
| Long prompt | 1,255 | -23/23 | 675 |
| Indoor | 1,254 | -17/17 | 1,244 |
| People | 1,253 | -11/11 | 2,697 |
| Buildings | 1,251 | -14/14 | 2,042 |
| Sci Fi | 1,250 | -20/20 | 860 |
| Transport |
| 1,246 |
| -15/15 |
| 1,574 |
| Weather and effects | 1,238 | -11/11 | 2,938 |
| Fashion | 1,238 | -27/27 | 492 |
| Photorealistic | 1,228 | -39/39 | 195 |