Kling 3.0 API

Name: EvoLink AI Model API Platform
Brand: EvoLink
Availability: InStock

Use EvoLink's unified API to access Kling 3.0 text-to-video and image-to-video. Generate 3-15 second videos with per-second billing, one integration path, and production-ready async delivery.

Model Type:

✓Kling 3.0 Text to Video Kling 3.0 Image to Video Custom Element

Price: $0.080 - 0.398(~ 5.4 - 27 credits) per second of video

Stable managed access for production workloads. Recommended when you need dashboard billing, API key control, and predictable integration behavior.

Use the same API endpoint for all versions. Only the model parameter differs.

Prompt*

105 (suggested: 2,000)

Aspect Ratio

Duration5s

3s15s

Quality

Sound

0:00 / 0:00

Audio

History

Max 20 items

0 running · 0 completed

Your generation history will appear here

Billing Rules

•Price shown is per second
•Duration range: 3-15 seconds
•Total = price/second × duration

Pricing

Model	Mode	Quality	Sound	Price
Kling 3.0 Text to Video	Video Generation	720p	Off	$0.080/ second(5.4 Credits)
Kling 3.0 Text to Video	Video Generation	720p	On	$0.120/ second(8.1 Credits)
Kling 3.0 Text to Video	Video Generation	1080p	Off	$0.106/ second(7.2036 Credits)
Kling 3.0 Text to Video	Video Generation	1080p	On	$0.159/ second(10.8 Credits)
Kling 3.0 Text to Video	Video Generation	4K	Off	$0.398/ second(27 Credits)
Kling 3.0 Text to Video	Video Generation	4K	On	$0.398/ second(27 Credits)

Kling 3.0 Text to Video

Video Generation

Quality:720p

Sound:Off

Price:

$0.080/ second

(5.4 Credits)

Kling 3.0 Text to Video

Video Generation

Quality:720p

Sound:On

Price:

$0.120/ second

(8.1 Credits)

Kling 3.0 Text to Video

Video Generation

Quality:1080p

Sound:Off

Price:

$0.106/ second

(7.2036 Credits)

Kling 3.0 Text to Video

Video Generation

Quality:1080p

Sound:On

Price:

$0.159/ second

(10.8 Credits)

Kling 3.0 Text to Video

Video Generation

Quality:4K

Sound:Off

Price:

$0.398/ second

(27 Credits)

Kling 3.0 Text to Video

Video Generation

Quality:4K

Sound:On

Price:

$0.398/ second

(27 Credits)

If an upstream route is unavailable, EvoLink can use the next available option where fallback coverage exists, helping teams keep costs and operations predictable.

Kling 3.0 API Pricing, Playground, and Integration

Access Kling 3.0 through EvoLink's unified API. Use text-to-video and image-to-video routes with async delivery, per-second pricing, and one integration path for production workflows.

Kling 3.0 pricing starts at $0.075 per second on EvoLink, compared to $0.084 on the official Kling API. Generate 3-15 second videos from text or images with free credits to start, no deposit required.

Hero showcase of Kling 3.0 video capabilities

Kling 3.0 overview and version history

Kling 3.0 is the standard video generation model in the Kling AI family by Kuaishou. Two modes — text-to-video and image-to-video — produce 3-15 second clips at 720p, 1080p, or 4K with per-second billing.

Compared to Kling 2.1 and 1.6, version 3.0 improved motion quality, scene coherence, and prompt adherence. It also added multi-shot support, AI sound effects, and subject control for consistent characters across clips. Access Kling 3.0 on EvoLink with free credits, a built-in playground, and pricing lower than the official rate.

Kling 3.0 API video modes and workflow features

Kling 3.0 Text-to-Video API

Generate videos directly from text prompts with Kling 3.0. Describe scenes, actions, and styles in natural language and let the model produce 3-15 second clips ready for marketing, social media, or creative projects.

Kling 3.0 Image-to-Video API

Use images to guide video generation. Kling 3.0 supports image-to-video mode, giving teams precise control over visual style, character consistency, and scene composition.

Kling 3.0 Multi-Shot and Sound Effects

Create complex multi-shot videos with scene transitions and add AI-generated sound effects. Kling 3.0 supports customizable shot sequences and audio generation for professional-quality output.

Why teams use Kling 3.0 through EvoLink

Kling 3.0 gives teams text-to-video and image-to-video access through one gateway, making pricing, routing, and production integration easier to manage.

One API for two core Kling 3.0 modes

Use the same integration path for text-to-video and image-to-video, instead of splitting implementation across separate vendor setups.

Cleaner production integration

Async task handling, one API key, and unified billing make it easier to run Kling 3.0 inside internal tools, creator products, and automation workflows.

Predictable per-second pricing

3-15 second output windows and visible quality options help teams estimate cost before sending production traffic.

How to integrate the Kling 3.0 API

From input to production-ready video in three steps.

Choose your mode

Select text-to-video or image-to-video based on your workflow needs.

Submit a generation task

Send your request with prompts or images. Track the async task until results are ready.

Review and iterate

Download results, compare variations, and reuse the same structure for fast iteration across campaigns.

View API Docs

Kling 3.0 API capabilities

Text-to-video and image-to-video access through one production-ready gateway

Text

Text-to-video generation

Generate videos purely from text descriptions. Kling 3.0 interprets natural language prompts to produce dynamic video content without requiring any visual input.

Image

Image-to-video transformation

Transform static images into dynamic videos. Provide reference images and let Kling 3.0 animate them with natural motion and scene dynamics.

Multi-Shot

Multi-shot support

Create complex multi-shot videos with customizable scene transitions, per-shot prompts, and duration control for professional video production.

Sound

Sound effects

Add AI-generated sound effects to your videos. Toggle sound on or off based on your needs, with transparent pricing for audio generation.

Billing

Per-second billing

Pay only for what you generate with per-second billing. Videos range from 3 to 15 seconds, giving teams precise cost control for every project.

Quality

720p, 1080p & 4K quality

Choose between standard 720p, high-quality 1080p, or ultra-HD 4K output resolution to balance quality and cost for your specific use case.

Kling 3.0 API FAQ

Everything you need to know about the product and billing.

The Kling 3.0 API provides access to Kling's 3.0 video model through EvoLink. It supports two modes: text-to-video and image-to-video. Each mode generates 3-15 second videos with per-second billing. Use your EvoLink dashboard for current pricing and availability.

Kling 3.0 offers two modes: text-to-video for generating from prompts, and image-to-video for animating images. Each mode is optimized for different production workflows.

Kling 3.0 generates videos between 3 and 15 seconds. Billing is per-second within this range. Videos shorter than 3 seconds are billed at the 3-second minimum. This range is suitable for social media clips, ads, and short-form content.

Kling 3.0 uses per-second billing at 5.4 credits per second base rate. The price varies by quality and sound: 720p+off = 1.0x, 720p+on = 1.5x, 1080p+off = 1.334x, 1080p+on = 2.0x, 4K = 5.0x (sound surcharge does not apply at 4K). Check your EvoLink dashboard for your group's specific pricing.

Kling O3 (V3 Omni) supports four modes including reference-to-video and video editing, while Kling 3.0 focuses on text-to-video and image-to-video. 3.0 has slightly different pricing factors compared to O3.

Start with a clear subject and describe the action, mood, and setting in simple terms. For image-to-video, provide high-quality reference images. Consistency improves when your prompt structure stays stable across runs.

Limits, pricing, and available modes are determined by your provider and region. Use your EvoLink dashboard and API responses as the source of truth. Check the API documentation for the most current constraints and parameters.

All Kling AI Models

EvoLink provides unified API access to the full Kling model family: All models share the same API key. Switch models with one parameter.

Explore Kling family View Kling O1 View Kling O3 View Motion Control

API Reference

Select endpoint

Endpoints

Authentication

All APIs require Bearer Token authentication.

Header

Authorization: 
Bearer YOUR_API_KEY

Get API Key

POST

/v1/videos/generations

Create Video

Kling 3.0 Text to Video (kling-v3-text-to-video) generates videos from text prompts using the 3.0 model. Supports single-shot and multi-shot modes with optional sound effects.

Asynchronous processing mode, use the returned task ID to query status.

Generated video links are valid for 24 hours, please save them promptly.

Important Notes

Text-to-video mode: no image input required.
Video duration: 3-15 seconds, billed per second.
Pricing varies by quality and sound: 720p+off = 1.0x, 720p+on = 1.5x, 1080p+off = 1.334x, 1080p+on = 2.0x, 4k = 5.0x (sound surcharge does not apply at 4K).

Request Parameters

modelstringRequiredDefault: kling-v3-text-to-video

Video generation model name.

Examplekling-v3-text-to-video

promptstringRequired

Text prompt describing what kind of video to generate. When multi_shot=true and shot_type=customize, this can be empty (use multi_prompt instead).

Notes

Max 2500 characters
Reference elements using <<<element_1>>> syntax in the prompt

ExampleA golden retriever running through a sunlit meadow, cinematic slow motion.

durationintegerOptionalDefault: 5

Specifies the generated video duration in seconds.

Notes

Range: 3-15 seconds (integer)
Base price: 5.4 credits per second
Minimum billing: 3 seconds

Example5

aspect_ratiostringOptional

Video aspect ratio.

Value	Description
16:9	Landscape video
9:16	Portrait video
1:1	Square video

Example16:9

qualitystringOptionalDefault: 720p

Video resolution quality. Affects billing multiplier.

Value	Description
720p	Standard 720P (1.0x base)
1080p	High quality 1080P (1.334x base)
4k	Ultra HD 4K (5.0x base, sound surcharge does not apply)

Example720p

soundstringOptionalDefault: off

Sound effect control. Affects billing multiplier (no effect when quality=4k).

Value	Description
off	No sound effects (1.0x)
on	Generate sound effects (1.5x)

Notes

Combined multiplier: 720p+off=1.0x, 720p+on=1.5x, 1080p+off=1.334x, 1080p+on=2.0x, 4k=5.0x (sound has no effect)

Exampleoff

callback_urlstringOptional

HTTPS callback address after task completion.

Notes

Triggered on completion, failure, or cancellation
HTTPS only, no internal IPs
Max length: 2048 chars
Timeout: 10s, Max 3 retries

Examplehttps://your-domain.com/webhooks/video-task-completed

model_params.multi_shotbooleanOptionalDefault: false

Enable multi-shot mode for generating videos with multiple camera angles or scenes.

Notes

When enabled, prompt parameter will be ignored — use multi_prompt instead
Sum of all shot duration values must equal total video duration

Exampletrue

model_params.shot_typestringOptional

Shot type for multi-shot mode. Required when multi_shot is true.

Value	Description
customize	Custom per-shot prompts and durations
intelligence	AI auto-plans shots based on prompt

Notes

Only effective when multi_shot=true

Examplecustomize

model_params.multi_promptarrayOptional

Per-shot prompt array. Required when multi_shot=true and shot_type=customize. Each item defines a shot segment.

Notes

Format: [{index: number, prompt: string, duration: string}, ...]
Max 6 shots, each shot prompt max 512 characters
Sum of all shot durations must equal total video duration
When used, top-level prompt can be empty

Example

[{"index": 1, "prompt": "A person on a hilltop", "duration": "5"}, {"index": 2, "prompt": "Camera pulls back", "duration": "5"}]

negative_promptstringOptional

Negative prompt describing what you don't want in the video.

Notes

Max 2500 characters
Optional

Exampleblurry, watermark, text, low quality

model_params.watermark_infoobjectOptional

Watermark configuration for the generated video.

Notes

Format: {enabled: boolean}

Example{"enabled": false}

Request Example

{
  "model": "kling-v3-text-to-video",
  "prompt": "A golden retriever running through a sunlit meadow, cinematic slow motion.",
  "duration": 5,
  "aspect_ratio": "16:9",
  "quality": "720p",
  "sound": "off"
}

Multi-Shot Example

{
  "model": "kling-v3-text-to-video",
  "duration": 10,
  "aspect_ratio": "16:9",
  "quality": "1080p",
  "sound": "on",
  "model_params": {
    "multi_shot": true,
    "shot_type": "customize",
    "multi_prompt": [
      {"index": 1, "prompt": "A person standing on a hilltop watching sunrise", "duration": "5"},
      {"index": 2, "prompt": "Camera pulls back to reveal a vast mountain panorama", "duration": "5"}
    ]
  }
}

Response Example

{
  "created": 1757169743,
  "id": "task-unified-1757169743-v3t2v",
  "model": "kling-v3-text-to-video",
  "object": "video.generation.task",
  "progress": 0,
  "status": "pending",
  "task_info": {
    "can_cancel": true,
    "estimated_time": 180,
    "video_duration": 5
  },
  "type": "video",
  "usage": {
    "billing_rule": "per_second",
    "credits_reserved": 27.0,
    "user_group": "default"
  }
}