Seedance 2.0 API — Coming SoonGet early access

Kling 3.0 API

Kling 3.0 video model with text-to-video, image-to-video, and custom element creation. Supports 3-15 second videos with per-second billing.

Estimated Cost (5s)
5.4 Credits
Sample Result

No sample available

0 (suggested: 2,000)
5s
3s15s

Click Generate to see preview

History

Max 20 items

0 running · 0 completed

Your generation history will appear here

Billing Rules

  • Price shown is per second
  • Duration range: 3-15 seconds
  • Total = price/second × duration

Pricing

Popular
Kling 3.0 Text to Video
Video Generation
Quality:720p
Sound:Off
Price:
$0.0750/ second
(5.4 Credits)
Kling 3.0 Text to Video
Video Generation
Quality:720p
Sound:On
Price:
$0.1125/ second
(8.1 Credits)
Kling 3.0 Text to Video
Video Generation
Quality:1080p
Sound:Off
Price:
$0.1000/ second
(7.1982 Credits)
Kling 3.0 Text to Video
Video Generation
Quality:1080p
Sound:On
Price:
$0.1500/ second
(10.8 Credits)

If it's down, we automatically use the next cheapest available—ensuring 99.9% uptime at the best possible price.

Kling 3.0 API for video creation

Build with the Kling 3.0 model. Generate videos from text or images through one unified API with 3-15 second output support.

Hero showcase of Kling 3.0 video capabilities

What can you build with the Kling 3.0 API?

Text-to-video creation

Generate videos directly from text prompts with Kling 3.0. Describe scenes, actions, and styles in natural language and let the model produce 3-15 second clips ready for marketing, social media, or creative projects.

Text-to-video showcase

Image-driven video generation

Use images to guide video generation. Kling 3.0 supports image-to-video mode, giving teams precise control over visual style, character consistency, and scene composition.

Image-driven video showcase

Multi-shot and sound effects

Create complex multi-shot videos with scene transitions and add AI-generated sound effects. Kling 3.0 supports customizable shot sequences and audio generation for professional-quality output.

Multi-shot video showcase

Why teams choose Kling 3.0

Kling 3.0 provides text-to-video and image-to-video modes in a single model family with competitive pricing.

Two specialized modes

Text and image modes cover the core video creation workflow.

3.0 architecture

Built on Kling's 3.0 foundation for quality video generation.

Flexible 3-15s output

Generate videos from 3 to 15 seconds with per-second billing.

How to integrate the Kling 3.0 API

From input to production-ready video in three steps.

1

Choose your mode

Select text-to-video or image-to-video based on your workflow needs.

2

Submit a generation task

Send your request with prompts or images. Track the async task until results are ready.

3

Review and iterate

Download results, compare variations, and reuse the same structure for fast iteration across campaigns.

Core capabilities of the Kling 3.0 API

Video AI with two specialized modes

Text

Text-to-video generation

Generate videos purely from text descriptions. Kling 3.0 interprets natural language prompts to produce dynamic video content without requiring any visual input.

Image

Image-to-video transformation

Transform static images into dynamic videos. Provide reference images and let Kling 3.0 animate them with natural motion and scene dynamics.

Multi-Shot

Multi-shot support

Create complex multi-shot videos with customizable scene transitions, per-shot prompts, and duration control for professional video production.

Sound

Sound effects

Add AI-generated sound effects to your videos. Toggle sound on or off based on your needs, with transparent pricing for audio generation.

Billing

Per-second billing

Pay only for what you generate with per-second billing. Videos range from 3 to 15 seconds, giving teams precise cost control for every project.

Quality

720p & 1080p quality

Choose between standard 720p and high-quality 1080p output resolution to balance quality and cost for your specific use case.

Frequently Asked Questions

Everything you need to know about the product and billing.

The Kling 3.0 API provides access to Kling's 3.0 video model through EvoLink. It supports two modes: text-to-video and image-to-video. Each mode generates 3-15 second videos with per-second billing. Use your EvoLink dashboard for current pricing and availability.
Kling 3.0 offers two modes: text-to-video for generating from prompts, and image-to-video for animating images. Each mode is optimized for different production workflows.
Kling 3.0 generates videos between 3 and 15 seconds. Billing is per-second within this range. Videos shorter than 3 seconds are billed at the 3-second minimum. This range is suitable for social media clips, ads, and short-form content.
Kling 3.0 uses per-second billing at 5.4 credits per second base rate. The price varies by quality and sound: 720p+off = 1.0x, 720p+on = 1.5x, 1080p+off = 1.333x, 1080p+on = 2.0x. Check your EvoLink dashboard for your group's specific pricing.
Kling O3 (V3 Omni) supports four modes including reference-to-video and video editing, while Kling 3.0 focuses on text-to-video and image-to-video. 3.0 has slightly different pricing factors compared to O3.
Start with a clear subject and describe the action, mood, and setting in simple terms. For image-to-video, provide high-quality reference images. Consistency improves when your prompt structure stays stable across runs.
Limits, pricing, and available modes are determined by your provider and region. Use your EvoLink dashboard and API responses as the source of truth. Check the API documentation for the most current constraints and parameters.
POST
/v1/videos/generations

Create Video

Kling 3.0 Text to Video (kling-v3-text-to-video) generates videos from text prompts using the 3.0 model. Supports single-shot and multi-shot modes with optional sound effects.

Asynchronous processing mode, use the returned task ID to query status.

Generated video links are valid for 24 hours, please save them promptly.

Important Notes

  • Text-to-video mode: no image input required.
  • Video duration: 3-15 seconds, billed per second.
  • Pricing varies by quality and sound: 720p+off = 1.0x, 720p+on = 1.5x, 1080p+off = 1.333x, 1080p+on = 2.0x.

Request Parameters

modelstringRequiredDefault: kling-v3-text-to-video

Video generation model name.

Examplekling-v3-text-to-video
promptstringRequired

Text prompt describing what kind of video to generate. When multi_shot=true and shot_type=customize, this can be empty (use multi_prompt instead).

Notes
  • Max 2500 characters
ExampleA golden retriever running through a sunlit meadow, cinematic slow motion.
durationintegerOptionalDefault: 5

Specifies the generated video duration in seconds.

Notes
  • Range: 3-15 seconds (integer)
  • Base price: 5.4 credits per second
  • Minimum billing: 3 seconds
Example5
aspect_ratiostringOptional

Video aspect ratio.

ValueDescription
16:9Landscape video
9:16Portrait video
1:1Square video
Example16:9
qualitystringOptionalDefault: 720p

Video resolution quality. Affects billing multiplier.

ValueDescription
720pStandard 720P (1.0x base)
1080pHigh quality 1080P (1.333x base)
Example720p
soundstringOptionalDefault: off

Sound effect control. Affects billing multiplier.

ValueDescription
offNo sound effects (1.0x)
onGenerate sound effects (1.5x)
Notes
  • Combined multiplier: 720p+off=1.0x, 720p+on=1.5x, 1080p+off=1.333x, 1080p+on=2.0x
Exampleoff
callback_urlstringOptional

HTTPS callback address after task completion.

Notes
  • Triggered on completion, failure, or cancellation
  • HTTPS only, no internal IPs
  • Max length: 2048 chars
  • Timeout: 10s, Max 3 retries
Examplehttps://your-domain.com/webhooks/video-task-completed
model_params.multi_shotbooleanOptionalDefault: false

Enable multi-shot mode for generating videos with multiple camera angles or scenes.

Notes
  • When enabled, shot_type and multi_prompt become relevant
Exampletrue
model_params.shot_typestringOptional

Shot type for multi-shot mode. Required when multi_shot is true.

ValueDescription
customizeCustom per-shot prompts and durations
intelligenceAI auto-plans shots based on prompt
Notes
  • Only effective when multi_shot=true
Examplecustomize
model_params.multi_promptarrayOptional

Per-shot prompt array. Required when multi_shot=true and shot_type=customize. Each item defines a shot segment.

Notes
  • Format: [{index: number, prompt: string, duration: string}, ...]
  • Max 6 shots
  • Total duration of all shots should match the requested duration
  • When used, top-level prompt can be empty
Example[{"index": 1, "prompt": "A person on a hilltop", "duration": "5"}, {"index": 2, "prompt": "Camera pulls back", "duration": "5"}]
negative_promptstringOptional

Negative prompt describing what you don't want in the video.

Notes
  • Max 2500 characters
  • Optional
Exampleblurry, watermark, text, low quality
model_params.watermark_infoobjectOptional

Watermark configuration for the generated video.

Notes
  • Format: {enabled: boolean}
Example{"enabled": false}

Request Example

{
  "model": "kling-v3-text-to-video",
  "prompt": "A golden retriever running through a sunlit meadow, cinematic slow motion.",
  "duration": 5,
  "aspect_ratio": "16:9",
  "quality": "720p",
  "sound": "off"
}

Multi-Shot Example

{
  "model": "kling-v3-text-to-video",
  "duration": 10,
  "aspect_ratio": "16:9",
  "quality": "1080p",
  "sound": "on",
  "model_params": {
    "multi_shot": true,
    "shot_type": "customize",
    "multi_prompt": [
      {"index": 1, "prompt": "A person standing on a hilltop watching sunrise", "duration": "5"},
      {"index": 2, "prompt": "Camera pulls back to reveal a vast mountain panorama", "duration": "5"}
    ]
  }
}

Response Example

{
  "created": 1757169743,
  "id": "task-unified-1757169743-v3t2v",
  "model": "kling-v3-text-to-video",
  "object": "video.generation.task",
  "progress": 0,
  "status": "pending",
  "task_info": {
    "can_cancel": true,
    "estimated_time": 180,
    "video_duration": 5
  },
  "type": "video",
  "usage": {
    "billing_rule": "per_second",
    "credits_reserved": 27.0,
    "user_group": "default"
  }
}
Kling 3.0 API: Video Generation & Editing | EvoLink