Seedance 2.0 API

Name: EvoLink AI Model API Platform
Brand: EvoLink
Availability: InStock

Seedance 2.0 is ByteDance's second-generation video model for multimodal video generation, now with full real human video support. Available globally through EvoLink with per-second billing.

Model Type:

✓Text to Video Image to Video Reference to Video

Fast Text to Video Fast Image to Video Fast Reference to Video

Price: $0.092 - 0.496(~ 6.2775 - 33.75 credits) per second of video

Highest stability with guaranteed 99.9% uptime. Recommended for production environments.

Use the same API endpoint for all versions. Only the model parameter differs.

Prompt

Need inspiration? Browse Seedance 2.0 prompts

5-second ultra-cinematic mythic teaser, horizontal 16:9. A woman rides a white horse across a clear turquoise sea, with the waterline splitting the world into two dimensions—above and below.

The opening is a half-above, half-underwater tracking shot, calm and sacred in tone, with realistic splashes and natural sunlight filtering underwater. The camera slowly pushes in with a slight arc as the horse walks steadily forward; the woman, dressed in flowing ivory robes, sits upright with a distant, prophetic presence, wind moving her fabric and earrings.

Brief cut to a close shot: she seems to sense a calling ahead, her eyes lift slightly, and her breath pauses for a moment.

The camera then widens as the horse continues forward with stronger splashes; the light on the horizon gradually intensifies, as if an unseen gate is opening.

The final shot pushes into the glow, conveying a sense of divine return and fate awakening. Premium cinematic look, realistic water physics, natural horse motion, subtle mystical atmosphere, no extra characters, no text.

1,061 (suggested: 2,000)

Duration5s

4s15s

Quality

Aspect Ratio

Generate Audio

Generate synchronized audio. No extra charge.

0:00 / 0:00

Audio

History

Max 20 items

0 running · 0 completed

Your generation history will appear here

Billing Rules

•Price shown is per second of output video
•Duration range: 4-15 seconds
•Audio generation: included, no extra charge
•Web search: charged per request when enabled. A single request may trigger multiple searches.

Pricing

Model	Mode	Quality	Price
Seedance 2.0 Text to Video	Video Generation	480p	$0.092/ second(6.2775 Credits)
Seedance 2.0 Text to Video	Video Generation	720p	$0.199/ second(13.5 Credits)
Seedance 2.0 Text to Video	Video Generation	1080p	$0.496/ second(33.75 Credits)
Seedance 2.0 Text to Video	Web Search (per request)	-	$0.0006/ request(0.04 Credits)

Seedance 2.0 Text to Video

Video Generation

Quality:480p

Price:

$0.092/ second

(6.2775 Credits)

Seedance 2.0 Text to Video

Video Generation

Quality:720p

Price:

$0.199/ second

(13.5 Credits)

Seedance 2.0 Text to Video

Video Generation

Quality:1080p

Price:

$0.496/ second

(33.75 Credits)

Seedance 2.0 Text to Video

Web Search (per request)

Price:

$0.0006/ request

(0.04 Credits)

If it's down, we automatically use the next cheapest available—ensuring 99.9% uptime at the best possible price.

ByteDance Seedance 2.0 API and AI Video Generator

Use the Seedance 2.0 API to run ByteDance Seedance as an AI video generator for text, image, video, and audio inputs — including real human video generation with lifelike faces, expressions, and full-body motion. This Seedance video generator adds the @-reference system, synchronized audio, video-to-video editing, and up to 15-second generation, while the Pricing tab helps you compare current Seedance price options.

ByteDance Seedance 2.0 AI video generator and API showcase

What can you build with Seedance 2.0 API?

Seedance 2.0 Reference-Driven Video Production

With Seedance 2.0 API, upload a reference video and the model extracts camera movement, motion dynamics, and transition style via the @-reference system. Generate dozens of on-brand video variations from one hero clip without reshooting. Supports up to 3 video references per request for combining camera from one source, motion from another, and style from a third.

Seedance 2.0 Audio-Synced Content with Reference Audio

Seedance 2.0 API lets you provide up to 3 audio tracks as references. Seedance 2.0 aligns cuts, motion energy, and scene transitions to beat and rhythm. The output includes native synchronized audio: dialogue lip-syncs in multiple languages, sound effects match on-screen action, and background music follows the mood of your audio reference.

Seedance 2.0 Multi-Reference Storyboard to Video

Seedance 2.0 lets you combine up to 9 reference images with text prompts to control composition, character, and environment across shots. Seedance 2.0 fills the gaps between keyframes with consistent identity, lighting, and style. Ideal for ad production, product demo sequences, and animated storyboards.

Seedance 2.0 Real Human Video Generation

Seedance 2.0 now fully supports real human video generation through EvoLink's API. Upload a portrait photo and generate video with lifelike facial expressions, natural micro-expressions, full-body motion including dance and athletics, and multi-language lip-synced dialogue. Ideal for face-led ads, spokesperson content, influencer-style creative, and realistic portrait storytelling.

How Seedance 2.0 Compares - All models on one EvoLink API key

Seedance 2.0 leads with multimodal @-reference inputs, video-to-video editing, and the highest image reference count among major video generation models.

Seedance 2.0 API Multimodal @-Reference System

Seedance 2.0 is the only model supporting video, audio, and image references in a single request. Upload up to 9 images + 3 videos + 3 audio tracks to control camera, motion, rhythm, and style - capabilities unavailable in Sora 2, Kling 3.0, or Seedance 1.5 Pro.

Seedance 2.0 API Video-to-Video Editing

Seedance 2.0 API enables editing specific segments, characters, or actions in existing videos. Seedance 2.0 supports targeted V2V modifications - a feature not available in Sora 2 or Kling 3.0.

Seedance 2.0 API Real Human Video Support

Seedance 2.0 on EvoLink fully supports real human video generation — upload a portrait photo and produce video with lifelike expressions, full-body motion, and multi-language lip-sync. This is a capability that remains limited or restricted on competing platforms like Kling 3.0 and Sora 2.

Seedance 2.0 API Competitive Pricing via EvoLink

Access Seedance 2.0 API through EvoLink's unified API with competitive per-second pricing. If you are comparing Seedance price, one API key gives you access to ByteDance Seedance, Sora 2, Kling 3.0, Veo 3.1, and more - with automatic routing to the best provider.

Seedance 2.0 vs Kling 3.0 vs Sora 2

Feature	Seedance 2.0	Kling 3.0	Sora 2
EvoLink price	From $0.092/s	$0.079/s	$0.08/s
Current route quality	480p / 720p / 1080p	720p / 1080p	720p
Native audio	Yes	AI sound effects	Yes (synchronized)
Reference control	Text + image + video + audio	Text + image	Text + image
Video length	4-15s	3-15s	4 / 8 / 12s
Real human video	Full support	Limited	Limited
Best for	Premium multimodal control, directed production, real human video	General-purpose video, per-second billing	OpenAI ecosystem, longer clips

How to Integrate Seedance 2.0 API

Seamlessly integrate multimodal video generation into your app with EvoLink's unified API. Supports T2V, I2V, and V2V modes.

Step 1 - Get Your API Key

Sign up on EvoLink.ai and generate your secure API key from the dashboard. This key authenticates all your requests to the Seedance 2.0 endpoint.

Step 2 - Submit Generation Task

Send a POST request to `/v1/videos/generations` with your text prompt, image URLs, or video/audio references. Use the `references` parameter to pass video, audio, and image inputs for the @-reference system. The API processes this asynchronously and returns a task `id` for tracking.

Step 3 - Retrieve Video Result

Use the `task_id` to poll the status endpoint or configure a webhook. Once completed, you'll receive a secure URL to download your synchronized audio-video file in MP4+AAC format.

Seedance 2.0 API Capabilities

Technical specifications for multimodal video production

Multimodal

@-Reference System

Upload up to 9 images + 3 videos + 3 audio references per request. The model extracts camera paths, motion patterns, rhythm, and style from source media.

Quality

Up to 1080p Output

The current EvoLink Seedance 2.0 route exposes 480p, 720p, and 1080p quality options, depending on mode and pricing.

Flexibility

4-15s Duration

Supports variable video lengths from 4 to 15 seconds, with multi-shot consistency for longer narrative sequences.

Modes

Text, Image, Video & Audio Input

Supports T2V (text-to-video), I2V (image-to-video), and V2V (video-to-video) generation modes with combinable multimodal inputs.

Audio

Native Audio with Lip-Sync

Generates synchronized dialogue, sound effects, and background music. Lip-sync support for multiple languages.

Human

Real Human Video Generation

Generate realistic videos of real people from reference photos. Supports lifelike facial expressions, micro-expressions, full-body motion, and multi-language lip-synced dialogue.

Licensing

Commercial Rights

Commercial usage rights subject to BytePlus terms, enabled securely through the EvoLink platform.

Cost Example

100 × 5s 720p videos (Standard)500s × $0.198/s = $99

100 × 5s 480p videos (Fast)500s × $0.092/s = $46

1,000 × 5s 720p videos/month5,000s × $0.198/s = $990

Iterate prompts at 480p ($0.092/s) then promote winners to 720p ($0.198/s).

Explore more video generation models on EvoLink →

Seedance 2.0 API Frequently Asked Questions

Everything you need to know about the product and billing.

Seedance 2.0 is ByteDance's second-generation video model. Compared to Seedance 1.5 Pro, it introduces the @-reference system for multimodal inputs (video, audio, and image references), video-to-video editing mode, 15-second max duration (was 12s), and up to 9 image + 3 video + 3 audio references per request. It is a major upgrade in both capabilities and flexibility.

Seedance 2.0 is billed per second through EvoLink. On the current public route, text-to-video pricing is exposed at 480p, 720p, and 1080p, and audio generation is included at no extra charge. Check the pricing table above for the exact current per-second rates.

Yes. EvoLink provides Seedance 2.0 API access to developers worldwide. You can integrate using a single API key. For details on supported regions and any access considerations, see the Seedance 2.0 API access guide.

The @-reference system is Seedance 2.0's breakthrough feature. It allows you to upload reference media (videos, audio tracks, images) and the model extracts specific attributes - camera paths from video, rhythm and beat from audio, composition and style from images. You can combine references from different sources in a single request to precisely control the output.

Seedance 2.0 supports text prompts combined with up to 9 reference images, 3 reference videos, and 3 reference audio tracks - all combinable in a single request. Generation modes include text-to-video (T2V), image-to-video (I2V), and video-to-video (V2V) editing.

Seedance 2.0 is the only model offering video and audio reference inputs, the @-reference system, and V2V editing - features not available in Sora 2 or Kling 3.0. It supports up to 9 image references (vs 1 for competitors). All three models are available through EvoLink's unified API. For a detailed side-by-side comparison, see our blog post.

Yes. Seedance 2.0 generates native synchronized audio including dialogue with lip-sync in multiple languages, sound effects matched to on-screen action, and background music. Audio generation can be toggled on/off per request, and audio references can be used to guide rhythm and mood.

On the current EvoLink route, output videos are in MP4 format (H.264) with AAC audio. Supported quality options are 480p, 720p, and 1080p. Duration: 4-15 seconds. Frame rate: 24fps. Aspect ratios: 16:9, 9:16, 1:1, 4:3, 3:4, 21:9, and adaptive.

Yes. EvoLink's Seedance API provides a unified endpoint for Seedance 2.0, 1.5 Pro, and 1.0 Pro Fast. Use the model parameter to switch between versions.

Yes. Seedance 2.0 on EvoLink fully supports real human video generation. Upload a portrait photo as a reference and generate video with lifelike facial expressions, natural micro-expressions, full-body motion (dance, athletics, gestures), and multi-language lip-synced dialogue. This makes it one of the strongest AI models for face-led ads, spokesperson content, and realistic portrait storytelling.

Seedance 2.0 is one of the stronger options for complex human motion such as dance and athletics. Combined with its @-reference system and native audio lip-sync, it supports a broader real human video workflow than Kling 3.0 or Sora 2, which have more limited support for real person reference inputs.

All Seedance API Models

EvoLink provides unified API access to the full Seedance model family: All models share the same Seedance API endpoint. Switch models with one parameter.

Explore Seedance family View Seedance 1.5 Pro View Seedance 1.0 Pro Fast

API Reference

Select endpoint

Authentication

All APIs require Bearer Token authentication.

Header

Authorization: 
Bearer YOUR_API_KEY

Get API Key

POST

/v1/videos/generations

Create Text-to-Video

Generate video from text prompts using Seedance 2.0. Supports optional web search for enhanced real-time content.

Now supports AIGC-generated realistic human materials.

Asynchronous processing — use the returned task ID to query status. Video links are valid for 24 hours.

Request Parameters

modelstringRequiredDefault: seedance-2.0-text-to-video

Fixed value: seedance-2.0-text-to-video

Exampleseedance-2.0-text-to-video

promptstringRequired

Text prompt describing the video to generate. Supports Chinese and English.

Notes

Chinese: ≤ 500 characters
English: ≤ 1000 words
This model does NOT support image_urls, video_urls, or audio_urls

ExampleA cinematic drone shot over a coastal city at golden hour

durationintegerOptionalDefault: 5

Video duration in seconds.

Value	Description
4-15	Any integer between 4 and 15

Notes

Duration directly affects billing

Example8

qualitystringOptionalDefault: 720p

Video resolution.

Value	Description
480p	Lower resolution, lower cost
720p	Standard (default)
1080p	High quality

Example720p

aspect_ratiostringOptionalDefault: 16:9

Video aspect ratio.

Value	Description
16:9	Landscape (1280×720 / 864×496)
9:16	Portrait (720×1280 / 496×864)
1:1	Square (960×960 / 640×640)
4:3	Standard (1112×834 / 752×560)
3:4	Portrait standard (834×1112 / 560×752)
21:9	Ultrawide (1470×630 / 992×432)
adaptive	Auto-select based on prompt

Example16:9

generate_audiobooleanOptionalDefault: true

Whether to generate synchronized audio. No extra charge.

Notes

Place dialogue within double quotes in prompt for better results

Exampletrue

model_params.web_searchbooleanOptionalDefault: false

Web search — model autonomously decides whether to search internet content based on the prompt.

Notes

May increase latency
Fees are only charged when searches are actually triggered
Multiple searches may occur once enabled
Wrapped inside model_params object

Examplefalse

callback_urlstringOptional

HTTPS callback URL for task completion notification.

Notes

Triggered on completion, failure, or cancellation
HTTPS only, max 2048 chars

Examplehttps://your-domain.com/webhooks/video-done

Request Example

{
  "model": "seedance-2.0-text-to-video",
  "prompt": "A cinematic drone shot over a coastal city at golden hour",
  "duration": 8,
  "quality": "720p",
  "aspect_ratio": "16:9",
  "generate_audio": true
}

Response Example

{
  "created": 1761313744,
  "id": "task-unified-1761313744-abc123",
  "model": "seedance-2.0-text-to-video",
  "object": "video.generation.task",
  "progress": 0,
  "status": "pending",
  "type": "video"
}