Features of Kling Video O1
1. Unified Multimodal Input - The Kling Omini model accepts text, images, and video as input simultaneously. This allows for complex, multi-layered scene creation that was previously impossible with single-input models.
2. Director-Level Control - Users gain precise control over camera movements, lighting, and character expressions. This feature, often called "Director Mode," ensures your final video matches your exact creative vision.
3. Extended Video Length - Kling Omini 1 can generate longer, more coherent video sequences, up to two minutes in length. This is a significant advantage for creating full narrative scenes or detailed product demonstrations.
4. Native Audio Integration - The model generates video with native, synchronized audio. This eliminates the need for external audio editing, streamlining your production workflow.
5. Unprecedented Consistency - The Kling Omini video generator maintains subject and style consistency across long clips. This solves the common problem of flickering or character drift in AI-generated videos.
6. High-Fidelity 1080p Output - Every video is rendered in stunning 1080p resolution. This ensures professional quality for all your projects, from social media to broadcast.
Input Media Supported
- Images: You can submit up to 7 images, each with a minimum resolution of 300 pixels, a maximum file size of 10 MB, in formats such as JPG, JPEG, or PNG.
- Videos: You may upload a single video, lasting between 3 to 10 seconds, with a maximum file size of 200 MB and a resolution of up to 2K.
- Elements: You can upload or generate multiple images (up to 4) from varying angles to compose an element. This provides the model with richer reference information.
Note: When a video is present, you can upload up to 4 images/elements combined. Without a video, you can upload up to 7 images/elements.














