Kling 3.0 动作控制 API

通过 EvoLink 统一 API 将参考视频的人物动作迁移到角色图像。720p 输出 0.1134 美元/秒起，支持 1080p，并可按朝向模式生成最长 30 秒视频。

模型类型:

Motion Control ✓Custom Element

价格: $0.015(~ 1 credits) per request

稳定性最高，保证 99.9% 可用性。推荐用于生产环境。

所有版本使用同一个 API 端点，仅 model 参数不同。

Reference Type*

Choose whether to create element from images or video

Reference Video *

Upload a video showing the subject clearly — must contain a visible human face (max 8s, 720-2160px)

Click to upload or drag and drop

Supported formats: MP4, MOV, WEBM
Maximum file size: 100MB; Maximum files: 1

Element Name*

Name of the subject element (max 20 characters)

Element Description*

Description of the subject element (max 100 characters)

No video URL found in response

Check console for details

History

最多保留20条

0 运行中 · 0 已完成

您的生成历史将显示在这里

Billing Rules

•Fixed price: 1 Credit per element creation
•Full refund on failure
•Estimated processing time: ~10 minutes

Pricing

Model	Mode	Type	Price
Kling Custom Element	Element Creation	Element Creation	$0.015/ request(1 Credits)

Kling Custom Element

Element Creation

Type:Element Creation

Price:

$0.015/ request

(1 Credits)

If it's down, we automatically use the next cheapest available—ensuring 99.9% uptime at the best possible price.

动作迁移 API

通过 EvoLink 统一 API 将参考视频动作迁移到角色图像，支持朝向控制、原声保留与异步交付，适合生产工作流。

起始价格 $0.1134/s
720p / 1080p
输入: 图像 + 视频
最长 10s / 30s

在一条可用于生产的路由中完成参考视频动作迁移、角色图像引导和异步交付。

浏览相关 Kling API 页面

Kling 3.0 API

如果你要做文生视频或图生视频，请使用 Kling 3.0 主页面，通过 EvoLink 统一 API 接入。

打开 Kling 3.0 API

Kling O3 API

如果你需要参考视频生成、视频编辑，或在一个页面查看四种视频模式，请使用 Kling O3。

打开 Kling O3 API

Kling 3.0 动作控制 API 应用场景与控制能力

Kling 角色动画生成 API

将静态角色图像通过参考表演视频驱动成动画，无需手动绑定骨骼或逐帧编辑。这种工作流适用于创作者工具、虚拟形象、短视频内容和角色驱动的视频制作，特别是需要自然人物动作的场景。

Kling 舞蹈与手势迁移 API

将舞蹈动作、手势和身体表演从一个视频迁移到不同角色，用于社交媒体制作、营销素材和创作者工作流。团队可以将单个动作源转换为多个角色主导的输出，加快营销活动的迭代速度。

支持朝向控制的 Kling 动作控制 API

控制生成的角色是跟随源图像的方向还是参考视频的方向，然后根据预算和制作需求选择 720p 或 1080p 输出。这为开发者提供了比通用图像转视频生成更结构化的动作控制管道。

为什么团队通过 EvoLink 使用 Kling 3.0 动作控制 API

Kling 3.0 动作控制为团队提供了一种更可控的方式，将真实表演素材转换为角色动画，具有可预测的按秒定价和生产就绪的 API 工作流。

比通用视频生成更可控

动作控制不是让模型从零开始创造动作，而是使用参考表演视频作为动作源。这使得输出行为更容易控制，特别适合角色驱动的工作流。

无需手动绑定骨骼，加快制作速度

团队可以将静态图像加上动作片段转换为可用的动画输出，无需构建传统的动画管道。这降低了原型、营销素材和创作者工具的设置门槛。

可预测的 API 定价，适合重复工作流

按秒计费、720p 和 1080p 模式以及明确的时长限制，让成本估算更容易。这对批量任务、用户生成内容工作流和自动化内容管道很重要。

如何集成 Kling 3.0 动作控制 API

从参考输入到交付动画的三个实用步骤。

准备清晰的角色图像和动作源视频

上传目标角色的参考图像和包含要迁移表演的参考视频。清晰的身体可见度和稳定的源动作可以提高输出一致性。

提交带动作设置的异步生成请求

将图像、视频、画质模式和朝向参数发送到 API。任务异步运行，更容易集成到生产管道、队列系统和自动化工作流中。

轮询任务状态、审查输出并迭代

检查任务状态直到结果就绪，然后审查生成的片段并在需要时优化输入。这种迭代循环适用于创作者产品、内部工具和可重复的视频工作流。

查看 API 文档

Kling 3.0 动作控制 API 核心能力

参考视频驱动的角色动画与生产级控制

动作

动作迁移

从参考视频中提取动作并应用到参考图像中的角色，生成自然的动画效果，无需手动操作。

朝向

角色朝向控制

选择输出角色面朝的方向——与参考图像一致或与参考视频一致，实现精确控制。

主体

用于角色一致性的主体元素支持

当你需要在多次生成中保持更稳定的角色外观时，可将 Motion Control 与 Kling 主体创建工作流配合使用。

声音

原声保留

可选择保留参考视频中的原始音频到生成的输出中。

计费

按秒计费

仅按实际生成时长付费。按秒计费，实际时长向上取整到最近的整数秒。

画质

720p 和 1080p 画质

在标准 720p（std）和高清 1080p（pro）输出之间选择，为特定用例平衡质量和成本。

Kling 3.0 动作控制 API 常见问题

Everything you need to know about the product and billing.

动作控制 API 将参考视频中的人物动作迁移到参考图像中的角色上。它使用 Kling 3.0 模型来合成自然的动画效果。参考图像和参考视频都是必需的输入。

您需要一张参考图像（要动画化的角色）和一个参考视频（动作来源）。角色应清晰露出上半身或全身。视频应包含清晰的人物动作，避免切镜和运镜。

最大时长取决于 character_orientation 参数：设为 'image'（角色朝向与图像一致）时最长 10 秒，设为 'video'（角色朝向与视频一致）时最长 30 秒。最短 3 秒。实际时长取决于参考视频中可提取的有效动作时长。

动作控制使用按秒计费，基础价格为每秒 8.1648 积分（720p）。1080p 画质为基础价格的 1.334 倍。按实际生成时长计费，向上取整到最近的整数秒。请在 EvoLink 控制台查看您所在用户组的具体定价。

此参数控制生成角色的面朝方向。设为 'image' 时角色朝向与参考图像一致（最长生成 10 秒视频）；设为 'video' 时角色朝向与参考视频一致（最长生成 30 秒视频）。

可以，您可以使用通过 Kling 主体创建功能创建的主体元素，通过 element_list 引用。使用主体元素时，角色朝向目前仅支持视频朝向。最多支持 1 个主体。

限制、定价和可用模式由您的提供商和地区决定。请将 EvoLink 控制台和 API 响应作为事实来源。查看 API 文档以获取最新的约束条件和参数。

API Reference

Select endpoint

Authentication

All APIs require Bearer Token authentication.

Header

Authorization: 
Bearer YOUR_API_KEY

Get API Key

POST

/v1/videos/generations

Create Element

Kling Custom Element (kling-custom-element) creates reusable subjects (elements) from reference videos. Created elements can be referenced in Kling V3 Motion Control, Kling O3 series, and Kling V3 Image-to-Video video generation via element_list for consistent character appearance.

Asynchronous processing mode, use the returned task ID to query status.

Upon completion, result_data will contain the element_id for use in video generation.

Important Notes

This model creates reusable elements (subjects), not videos. No prompt/duration/quality/aspect_ratio parameters needed.
The reference image or video must contain a clearly visible human face.
Fixed pricing: 1 Credit per element creation. Full refund on failure.
Estimated processing time: approximately 10 minutes.
After successful creation, use the returned element_id in Kling V3 Motion Control, Kling O3 series, and Kling V3 Image-to-Video video generation via element_list.

Request Parameters

modelstringRequiredDefault: kling-custom-element

Model name for custom element creation.

Examplekling-custom-element

model_params.element_namestringRequired

Name for the element (subject) being created.

Notes

Max 20 characters

ExampleMyCharacter

model_params.element_descriptionstringRequired

Description of the element, used to help the model understand the subject characteristics.

Notes

Max 100 characters

ExampleA young male character with short hair, wearing a white T-shirt

model_params.reference_typestringRequired

Type of reference material used to create the element.

Value	Description
video_refer	Use reference video to create element

Examplevideo_refer

model_params.element_video_listobjectOptional

Reference video for element creation. Required when reference_type is video_refer.

Notes

Conditionally required: must be provided when reference_type = video_refer
video_url (string): URL of the reference video
The reference video must contain a clearly visible human face

Example{"video_url": "https://example.com/reference.mp4"}

model_params.element_voice_idstringOptional

Voice ID to assign to the element. The voice will be used when the element speaks in generated videos.

Notes

Optional parameter — only supported when reference_type = video_refer
Not available for image_refer (image-based element creation does not support voice assignment)
See the Voice ID Reference section below for all available voice IDs

Example829824295735410756

callback_urlstringOptional

HTTPS callback address after task completion.

Notes

Triggered on completion, failure, or cancellation
HTTPS only, no internal IPs
Max length: 2048 chars
Timeout: 10s, Max 3 retries

Examplehttps://your-domain.com/webhooks/element-task-completed

Request Example (Video Reference)

{
  "model": "kling-custom-element",
  "model_params": {
    "element_name": "CharacterA",
    "element_description": "Middle-aged female with long hair, wearing a red dress",
    "reference_type": "video_refer",
    "element_video_list": {
      "video_url": "https://example.com/reference.mp4"
    },
    "element_voice_id": "829826751244537879"
  }
}

Response Example

{
  "created": 1739500000,
  "id": "task-video-xxxxxxxx",
  "model": "kling-custom-element",
  "object": "video.generation.task",
  "progress": 0,
  "status": "pending",
  "task_info": {
    "estimated_time": 600,
    "can_cancel": true
  },
  "usage": {
    "credits_reserved": 1,
    "billing_rule": "per_call"
  }
}