Kling O1 API
Kling O1 video generation model with image-to-video, video editing, and fast video editing variants. Supports 3-20 second videos with reference images for style-guided generation.
Prompt
Transform the video with cinematic color grading and smooth transitions.
Input Video
Upload a video for editing (6-20 seconds, max 100MB). Supports MP4, WebM, MOV, AVI formats.
Reference Images (Optional)
Upload up to 4 reference images for style guidance.
Parameters
Sample Result
Upload a video for editing (max 100MB)
Click to upload or drag and drop
Supported formats: MP4, MOV
Maximum file size: 100MB; Maximum files: 1
Upload reference images
Click to upload or drag and drop
Supported formats: JPG, JPEG, PNG, WEBP
Maximum file size: 10MB; Maximum files: 4
History
Max 20 items0 running · 0 completed
Kling O1 API for unified video generation and editing
Build modern video workflows with Kling O1. Use one API to create new clips from prompts, refine existing footage, and keep characters and scenes consistent across outputs for marketing, social content, and commerce.

What can you build with the Kling O1 API?
Prompt-to-video storytelling
Turn short creative briefs into videos with Kling O1 and keep the same look across multiple outputs. This is useful for social campaigns, brand series, or episodic content where consistency matters more than one-off experimentation.

Reference-driven edits
Use Kling O1 to refine or rework existing footage with instruction-based edits. Keep the core subject intact while adjusting style, lighting, or scene details so teams can iterate fast without a full re-shoot.

Commercial content at scale
Kling O1 is positioned for production teams in advertising, e-commerce, and social media. Use it to generate variations, keep brand tone consistent, and deliver content at the pace required by modern channels.

Why teams choose Kling O1
Kling O1 emphasizes unified creation and editing with consistency across characters and scenes, which reduces rework and keeps creative pipelines predictable.
Unified multimodal workflow
Text, image, video, and subject inputs live in one model.
Consistency-first outputs
Maintain recognizable characters and scenes across clips.
Production-friendly focus
Built for film, social, ads, and commerce workflows.
How to integrate the Kling O1 API
A simple flow from input to production-ready video.
Choose inputs and mode
Select text, image, video, or subject inputs based on your workflow and desired output type.
Submit a generation task
Send your request with instructions and any references, then track the task until results are ready.
Review and iterate
Download results, compare variations, and reuse the same structure for fast iteration.
Core capabilities of the Kling O1 API
Unified video creation and editing in one model
Unified multimodal engine
Kling O1 is introduced as a unified multimodal model that combines generation and editing in a single system. This allows teams to keep one integration while handling both new clip creation and edits across the same workflow.
Text, image, video, and subject inputs
Public descriptions highlight that Kling O1 supports text, image, video, and subject inputs. This gives creators more ways to control outputs and reduces guesswork when consistent results are required.
Consistency for characters and scenes
Kling O1 is positioned to address the consistency challenge in AI video generation. This helps teams keep character identity, props, and scene details aligned across multiple clips.
Generation plus editing workflows
Instead of switching tools, Kling O1 brings generation and editing tasks into one engine. This is useful for marketing teams that need to create, then refine, without breaking continuity.
Commercial content use cases
The model is described as suitable for film, television, social media, advertising, and e-commerce workflows. That makes it a practical choice for teams building content at scale.
Multimodal visual language
Kling O1 is built on a multimodal visual language framework. This helps it interpret intent across text and visual references so outputs align more closely with creative direction.
Frequently Asked Questions
Everything you need to know about the product and billing.
API Reference
Select endpoint
Authentication
All APIs require Bearer Token authentication.
Authorization:
Bearer YOUR_API_KEY/v1/videos/generationsEdit Video (Fast)
Kling O1 Video Edit Fast (kling-o1-video-edit-fast) model transforms and edits videos with AI-powered style transfer.
Asynchronous processing mode, use the returned task ID to query status.
Generated video links are valid for 24 hours, please save them promptly.
Important Notes
- Supports 6-20 second videos for editing.
- Videos under 6 seconds are billed as 6 seconds.
- Videos over 20 seconds will be truncated to 20 seconds.
- Maximum file size: 100MB per video.
- Supported formats: MP4, WebM, MOV, AVI.
- Optional: Up to 4 reference images for style guidance.
Request Parameters
modelstringRequiredDefault: kling-o1-video-edit-fastVideo editing model name.
kling-o1-video-edit-fastpromptstringRequiredPrompt describing how to edit and transform the video.
Notes
- Limited to 2000 tokens
Transform the video with cinematic color grading and smooth transitions.video_urlsarrayRequiredInput video URL list for video editing.
Notes
- Required: exactly 1 video
- Maximum: 1 video per request
- Max size: 100MB per video
- Formats: .mp4, .mov
- Duration: 6-20 seconds (longer videos will be truncated)
- URLs must be directly accessible by the server
["https://example.com/input-video.mp4"]image_urlsarrayOptionalReference image URL list for style guidance.
Notes
- Optional
- Maximum: 4 images per request
- Max size: 10MB per image
- Formats: .jpg, .jpeg, .png, .webp
- Use reference images to guide the AI in understanding your desired style
["https://example.com/reference1.jpg", "https://example.com/reference2.jpg"]keep_original_soundbooleanOptionalDefault: trueWhether to keep the original audio from the input video.
| Value | Description |
|---|---|
| true | Preserve original audio track |
| false | Generate new audio or mute |
trueaspect_ratiostringOptionalDefault: 16:9Output video aspect ratio.
| Value | Description |
|---|---|
| 16:9 | Landscape video |
| 9:16 | Portrait video |
| 1:1 | Square video |
'16:9'callback_urlstringOptionalHTTPS callback address after task completion.
Notes
- Triggered on completion, failure, or cancellation
- Sent after billing confirmation
- HTTPS only, no internal IPs
- Max length: 2048 chars
- Timeout: 10s, Max 3 retries
https://your-domain.com/webhooks/video-task-completed