Text-to-Video
Kling 2.6 introduces a groundbreaking "Native Audio" capability, enabling the generation of complete videos in a single go, including natural voice, action sound effects, and environmental ambient sounds, providing an immersive "what you see if what you hear" experience.
1,201
Jan 2026
video
$0.042/s
Catalog
Category rows come directly from Artificial Analysis when the endpoint exposes category-level Elo scores.
| Category | Elo | 95% CI | Appearances |
|---|---|---|---|
| Fantasy | 1,290 | -30/30 | 455 |
| Action | 1,274 | -23/23 | 724 |
| 3D animation | 1,254 | -43/43 | 158 |
| Sports | 1,247 | -24/24 | 650 |
| Moving camera | 1,241 | -22/22 | 904 |
| Specific location or era | 1,240 | -21/21 | 895 |
| People | 1,233 | -11/11 | 3,104 |
| Fashion | 1,233 | -26/26 | 574 |
| Photorealistic |
| 1,233 |
| -37/37 |
| 232 |
| Physics | 1,232 | -13/13 | 2,346 |
| Long prompt | 1,232 | -22/22 | 812 |
| Transport | 1,224 | -15/15 | 1,637 |