Kling 3.0 API
Kling 3.0 video model with text-to-video, image-to-video, and custom element creation. Supports 3-15 second videos with per-second billing.
Upload first frame or end frame image to generate video
No sample available
First frame image. At least one of first/end frame is required.
Click to upload or drag and drop
Supported formats: JPG, JPEG, PNG
Maximum file size: 10MB; Maximum files: 1
End-frame image (optional)
Click to upload or drag and drop
Supported formats: JPG, JPEG, PNG
Maximum file size: 10MB; Maximum files: 1
Click Generate to see preview
History
最多保留20条0 运行中 · 0 已完成
Billing Rules
- •Price shown is per second
- •Duration range: 3-15 seconds
- •Total = price/second × duration
Pricing
| Model | Mode | Quality | Sound | Price |
|---|---|---|---|---|
| Kling 3.0 Image to Video | Video Generation | 720p | Off | Popular $0.0750/ second(5.4 Credits) |
| Kling 3.0 Image to Video | Video Generation | 720p | On | $0.1125/ second(8.1 Credits) |
| Kling 3.0 Image to Video | Video Generation | 1080p | Off | $0.1000/ second(7.1982 Credits) |
| Kling 3.0 Image to Video | Video Generation | 1080p | On | $0.1500/ second(10.8 Credits) |
If it's down, we automatically use the next cheapest available—ensuring 99.9% uptime at the best possible price.
面向视频创作的 Kling 3.0 API
使用 Kling 3.0 模型进行构建。从文本或图像生成视频——通过一个统一的 API 完成,支持 3-15 秒输出。

利用 Kling 3.0 API 可以构建什么?
为什么团队选择 Kling 3.0
Kling 3.0 在单一模型家族中提供文生视频和图生视频两种模式,价格有竞争力。
两种专业模式
文本和图像模式覆盖核心视频创作工作流。
3.0 架构
基于 Kling 3.0 基础架构,提供高质量视频生成。
灵活的 3-15 秒输出
生成 3 到 15 秒的视频,按秒计费。
如何集成 Kling 3.0 API
从输入到生产就绪视频的三个步骤。
选择模式
根据您的工作流需求,选择文生视频或图生视频模式。
提交生成任务
发送带有提示词或图像的请求。跟踪异步任务直到结果就绪。
评审与迭代
下载结果,比较变体,并复用相同的结构在不同营销活动中快速迭代。
Kling 3.0 API 核心能力
视频 AI,两种专业模式
文生视频生成
纯粹从文本描述生成视频。Kling 3.0 解读自然语言提示,无需任何视觉输入即可生成动态视频内容。
图生视频转换
将静态图像转换为动态视频。提供参考图像,让 Kling 3.0 以自然的运动和场景动态将其动画化。
多镜头支持
创建复杂的多镜头视频,支持自定义场景切换、每镜头提示词和时长控制,满足专业视频制作需求。
音效
为视频添加 AI 生成的音效。根据需要开启或关闭音效,音频生成定价透明。
按秒计费
按秒计费,只为生成的内容付费。视频范围从 3 到 15 秒,让团队对每个项目都有精确的成本控制。
720p 和 1080p 质量
在标准 720p 和高质量 1080p 输出分辨率之间选择,为特定用例平衡质量和成本。
常见问题
Everything you need to know about the product and billing.
API Reference
Select endpoint
Authentication
All APIs require Bearer Token authentication.
Authorization:
Bearer YOUR_API_KEY/v1/videos/generationsCreate Video
Kling 3.0 Image to Video (kling-v3-image-to-video) transforms static images into dynamic videos using the 3.0 model. Supports first frame, end frame, subject control, multi-shot, and sound effects.
Asynchronous processing mode, use the returned task ID to query status.
Generated video links are valid for 24 hours, please save them promptly.
Important Notes
- At least one of image_start (first frame) or image_end (end frame) is required.
- Image requirements: JPG/JPEG/PNG, ≤ 10MB, width/height ≥ 300px, aspect ratio 1:2.5 ~ 2.5:1.
- Video duration: 3-15 seconds, billed per second.
- Pricing varies by quality and sound: 720p+off = 1.0x, 720p+on = 1.5x, 1080p+off = 1.333x, 1080p+on = 2.0x.
Request Parameters
modelstringRequiredDefault: kling-v3-image-to-videoVideo generation model name.
kling-v3-image-to-videopromptstringOptionalText prompt describing what kind of motion and video to generate.
Notes
- Max 2500 characters
- Optional for image-to-video
A gentle breeze moves through the scene, creating subtle motion and life.image_startstringOptionalFirst-frame image URL. At least one of image_start or image_end must be provided.
Notes
- JPG/JPEG/PNG format
- Max size: 10MB
- Width/height ≥ 300px, aspect ratio 1:2.5 ~ 2.5:1
https://example.com/first-frame.jpgimage_endstringOptionalEnd-frame image URL. At least one of image_start or image_end must be provided.
Notes
- Optional
- Same format requirements as image_start
https://example.com/end-frame.jpgdurationintegerOptionalDefault: 5Specifies the generated video duration in seconds.
Notes
- Range: 3-15 seconds (integer)
- Base price: 5.4 credits per second
- Minimum billing: 3 seconds
5aspect_ratiostringOptionalVideo aspect ratio. When a first-frame image is provided, this can be omitted (auto-adapts to image ratio).
| Value | Description |
|---|---|
| 16:9 | Landscape video |
| 9:16 | Portrait video |
| 1:1 | Square video |
16:9qualitystringOptionalDefault: 720pVideo resolution quality. Affects billing multiplier.
| Value | Description |
|---|---|
| 720p | Standard 720P (1.0x base) |
| 1080p | High quality 1080P (1.333x base) |
720psoundstringOptionalDefault: offSound effect control. Affects billing multiplier.
| Value | Description |
|---|---|
| off | No sound effects (1.0x) |
| on | Generate sound effects (1.5x) |
Notes
- Combined multiplier: 720p+off=1.0x, 720p+on=1.5x, 1080p+off=1.333x, 1080p+on=2.0x
offcallback_urlstringOptionalHTTPS callback address after task completion.
Notes
- Triggered on completion, failure, or cancellation
- HTTPS only, no internal IPs
- Max length: 2048 chars
- Timeout: 10s, Max 3 retries
https://your-domain.com/webhooks/video-task-completedmodel_params.multi_shotbooleanOptionalDefault: falseEnable multi-shot mode for generating videos with multiple camera angles or scenes.
Notes
- When enabled, shot_type and multi_prompt become relevant
truemodel_params.shot_typestringOptionalShot type for multi-shot mode. Required when multi_shot is true.
| Value | Description |
|---|---|
| customize | Custom per-shot prompts and durations |
| intelligence | AI auto-plans shots based on prompt |
Notes
- Only effective when multi_shot=true
customizemodel_params.multi_promptarrayOptionalPer-shot prompt array. Required when multi_shot=true and shot_type=customize. Each item defines a shot segment.
Notes
- Format: [{index: number, prompt: string, duration: string}, ...]
- Max 6 shots
- Total duration of all shots should match the requested duration
- When used, top-level prompt can be empty
[{"index": 1, "prompt": "Scene one", "duration": "5"}, {"index": 2, "prompt": "Scene two", "duration": "5"}]model_params.element_listarrayOptionalSubject library list for referencing pre-trained subjects in the video.
Notes
- Format: [{element_id: long}, ...]
- Max 3 subjects
- Reference subjects in prompt using <<<element_N>>> placeholder
[{"element_id": 123456}]negative_promptstringOptionalNegative prompt describing what you don't want in the video.
Notes
- Max 2500 characters
- Optional
blurry, watermark, text, low qualitymodel_params.watermark_infoobjectOptionalWatermark configuration for the generated video.
Notes
- Format: {enabled: boolean}
{"enabled": false}

