Seedance 2.0 API

Seedance 2.0 is ByteDance's second-generation video model for multimodal video generation. Available globally through EvoLink with per-second billing.

Model Type:

Text to Video Image to Video ✓Reference to Video

Fast Text to Video Fast Image to Video Fast Reference to Video

Price: $0.039 - 0.139(~ 2.82 - 10 credits) per second of video

With video input: billed on (input+output) duration at reduced rate. Min billable input = output duration.

Highest stability with guaranteed 99.9% uptime. Recommended for production environments.

Use the same API endpoint for all versions. Only the model parameter differs.

Prompt

93 (suggested: 2,000)

Reference Images (Optional)

Up to 9 images (jpg/png/webp, ≤30MB each, 300-6000px). Use prompt to assign roles (e.g., "image 1 as first frame"). Total request body ≤ 64MB.

Click to upload or drag and drop

Supported formats: JPG, JPEG, PNG, WEBP
Maximum file size: 30MB; Maximum files: 9

Reference Videos (Optional)

Up to 3 videos (mp4/mov, ≤50MB each, 480p/720p, 2-15s per clip, total ≤15s). Video input duration is billed at reduced rate.

Click to upload or drag and drop

Supported formats: MP4, MOV
Maximum file size: 50MB; Maximum files: 3

Reference Audio (Optional)

Up to 3 tracks (mp3/wav, ≤15MB each, 2-15s per clip, total ≤15s). ⚠️ Cannot be used alone — at least 1 image or video is required.

Click to upload or drag and drop

Supported formats: MP3, WAV
Maximum file size: 15MB; Duration: max 15s

Duration5s

4s15s

Quality

Aspect Ratio

Generate Audio

Generate synchronized audio. No extra charge.

Click Generate to see preview

History

Max 20 items

0 running · 0 completed

Your generation history will appear here

Billing Rules

•Without video input: billed per second of output video only.
•With video input: input video duration is also billed — both input and output seconds are charged at a reduced per-second rate (see pricing table).
•Minimum billable input duration = output duration (e.g., 3s input + 10s output → billed as 10s + 10s = 20s).
•Output duration range: 4–15 seconds.
•Audio generation: included at no extra charge.
•Supports: up to 9 images, 3 videos (≤15s total), 3 audio tracks (≤15s total).

Pricing

Model	Mode	Quality	Video Input	Price
Seedance 2.0 Reference to Video	Video Generation	480p	No	$0.064/ second(4.63 Credits)
Seedance 2.0 Reference to Video	Video Generation	720p	No	$0.139/ second(10 Credits)
Seedance 2.0 Reference to Video	Video Generation	480p	Yes (input+output)	$0.039/ second(2.82 Credits)
Seedance 2.0 Reference to Video	Video Generation	720p	Yes (input+output)	$0.085/ second(6.09 Credits)

Seedance 2.0 Reference to Video

Video Generation

Quality:480p

Video Input:No

Price:

$0.064/ second

(4.63 Credits)

Seedance 2.0 Reference to Video

Video Generation

Quality:720p

Video Input:No

Price:

$0.139/ second

(10 Credits)

Seedance 2.0 Reference to Video

Video Generation

Quality:480p

Video Input:Yes (input+output)

Price:

$0.039/ second

(2.82 Credits)

Seedance 2.0 Reference to Video

Video Generation

Quality:720p

Video Input:Yes (input+output)

Price:

$0.085/ second

(6.09 Credits)

If it's down, we automatically use the next cheapest available—ensuring 99.9% uptime at the best possible price.

ByteDance Seedance 2.0 API and AI Video Generator

Use the Seedance 2.0 API to run ByteDance Seedance as an AI video generator for text, image, video, and audio inputs. This Seedance video generator adds the @-reference system, synchronized audio, video-to-video editing, and up to 15-second generation, while the Pricing tab helps you compare current Seedance price options.

ByteDance Seedance 2.0 AI video generator and API showcase

What can you build with Seedance 2.0 API?

Seedance 2.0 Reference-Driven Video Production

With Seedance 2.0 API, upload a reference video and the model extracts camera movement, motion dynamics, and transition style via the @-reference system. Generate dozens of on-brand video variations from one hero clip without reshooting. Supports up to 3 video references per request for combining camera from one source, motion from another, and style from a third.

Seedance 2.0 Audio-Synced Content with Reference Audio

Seedance 2.0 API lets you provide up to 3 audio tracks as references. Seedance 2.0 aligns cuts, motion energy, and scene transitions to beat and rhythm. The output includes native synchronized audio: dialogue lip-syncs in multiple languages, sound effects match on-screen action, and background music follows the mood of your audio reference.

Seedance 2.0 Multi-Reference Storyboard to Video

Seedance 2.0 lets you combine up to 9 reference images with text prompts to control composition, character, and environment across shots. Seedance 2.0 fills the gaps between keyframes with consistent identity, lighting, and style. Ideal for ad production, product demo sequences, and animated storyboards.

How Seedance 2.0 Compares - All models on one EvoLink API key

Seedance 2.0 leads with multimodal @-reference inputs, video-to-video editing, and the highest image reference count among major video generation models.

Seedance 2.0 API Multimodal @-Reference System

Seedance 2.0 is the only model supporting video, audio, and image references in a single request. Upload up to 9 images + 3 videos + 3 audio tracks to control camera, motion, rhythm, and style - capabilities unavailable in Sora 2, Kling 3.0, or Seedance 1.5 Pro.

Seedance 2.0 API Video-to-Video Editing

Seedance 2.0 API enables editing specific segments, characters, or actions in existing videos. Seedance 2.0 supports targeted V2V modifications - a feature not available in Sora 2 or Kling 3.0.

Seedance 2.0 API Competitive Pricing via EvoLink

Access Seedance 2.0 API through EvoLink's unified API with competitive per-second pricing. If you are comparing Seedance price, one API key gives you access to ByteDance Seedance, Sora 2, Kling 3.0, Veo 3.1, and more - with automatic routing to the best provider.

How to Integrate Seedance 2.0 API

Seamlessly integrate multimodal video generation into your app with EvoLink's unified API. Supports T2V, I2V, and V2V modes.

Step 1 - Get Your API Key

Sign up on EvoLink.ai and generate your secure API key from the dashboard. This key authenticates all your requests to the Seedance 2.0 endpoint.

Step 2 - Submit Generation Task

Send a POST request to `/v1/videos/generations` with your text prompt, image URLs, or video/audio references. Use the `references` parameter to pass video, audio, and image inputs for the @-reference system. The API processes this asynchronously and returns a task `id` for tracking.

Step 3 - Retrieve Video Result

Use the `task_id` to poll the status endpoint or configure a webhook. Once completed, you'll receive a secure URL to download your synchronized audio-video file in MP4+AAC format.

Seedance 2.0 API Capabilities

Technical specifications for multimodal video production

Multimodal

@-Reference System

Upload up to 9 images + 3 videos + 3 audio references per request. The model extracts camera paths, motion patterns, rhythm, and style from source media.

Quality

Up to 720p Output

The current EvoLink Seedance 2.0 route exposes 480p and 720p quality options, depending on mode and pricing.

Flexibility

4-15s Duration

Supports variable video lengths from 4 to 15 seconds, with multi-shot consistency for longer narrative sequences.

Modes

Text, Image, Video & Audio Input

Supports T2V (text-to-video), I2V (image-to-video), and V2V (video-to-video) generation modes with combinable multimodal inputs.

Audio

Native Audio with Lip-Sync

Generates synchronized dialogue, sound effects, and background music. Lip-sync support for multiple languages.

Licensing

Commercial Rights

Commercial usage rights subject to BytePlus terms, enabled securely through the EvoLink platform.

Seedance 2.0 API Frequently Asked Questions

Everything you need to know about the product and billing.

Seedance 2.0 is ByteDance's second-generation video model. Compared to Seedance 1.5 Pro, it introduces the @-reference system for multimodal inputs (video, audio, and image references), video-to-video editing mode, 15-second max duration (was 12s), and up to 9 image + 3 video + 3 audio references per request. It is a major upgrade in both capabilities and flexibility.

Seedance 2.0 is billed per second through EvoLink. On the current public route, text-to-video pricing is exposed at 480p and 720p, and audio generation is included at no extra charge. Check the pricing table above for the exact current per-second rates.

Yes. EvoLink provides global access to Seedance 2.0 API with no regional restrictions. You can integrate from anywhere using a single API key.

The @-reference system is Seedance 2.0's breakthrough feature. It allows you to upload reference media (videos, audio tracks, images) and the model extracts specific attributes - camera paths from video, rhythm and beat from audio, composition and style from images. You can combine references from different sources in a single request to precisely control the output.

Seedance 2.0 supports text prompts combined with up to 9 reference images, 3 reference videos, and 3 reference audio tracks - all combinable in a single request. Generation modes include text-to-video (T2V), image-to-video (I2V), and video-to-video (V2V) editing.

Seedance 2.0 is the only model offering video and audio reference inputs, the @-reference system, and V2V editing - features not available in Sora 2 or Kling 3.0. It supports up to 9 image references (vs 1 for competitors). All three models are available through EvoLink's unified API. For a detailed side-by-side comparison, see our blog post.

Yes. Seedance 2.0 generates native synchronized audio including dialogue with lip-sync in multiple languages, sound effects matched to on-screen action, and background music. Audio generation can be toggled on/off per request, and audio references can be used to guide rhythm and mood.

On the current EvoLink route, output videos are in MP4 format (H.264) with AAC audio. Supported quality options are 480p and 720p. Duration: 4-15 seconds. Frame rate: 24fps. Aspect ratios: 16:9, 9:16, 1:1, 4:3, 3:4, 21:9, and adaptive.

Yes. EvoLink's Seedance API provides a unified endpoint for Seedance 2.0, 1.5 Pro, and 1.0 Pro Fast. Use the model parameter to switch between versions.

All Seedance API Models

EvoLink provides unified API access to the full Seedance model family: All models share the same Seedance API endpoint. Switch models with one parameter.

Explore Seedance family View Seedance 1.5 Pro View Seedance 1.0 Pro Fast

API Reference

Select endpoint

Authentication

All APIs require Bearer Token authentication.

Header

Authorization: 
Bearer YOUR_API_KEY

Get API Key

POST

/v1/videos/generations

Create Reference-to-Video

Multimodal reference-to-video generation with image, video, and audio references. Supports new generation, video editing, and video extension.

Asynchronous processing — use the returned task ID to query status. Video links are valid for 24 hours.

Request Parameters

modelstringRequiredDefault: seedance-2.0-reference-to-video

Fixed value: seedance-2.0-reference-to-video

Exampleseedance-2.0-reference-to-video

promptstringOptional

Text prompt describing the video. Use natural language to assign roles to references.

Notes

Example: 'Use video 1 camera movement, image 1 as style reference, audio 1 as background music'
Chinese: ≤ 500 characters, English: ≤ 1000 words

ExampleUse the camera movement from video 1, with image 1 as style reference

image_urlsarrayOptional

Reference image URLs (0-9 images).

Notes

Formats: jpg, png, webp
Aspect ratio: 0.4-2.5
Pixel range: 300-6000px per side
Max 30MB per image

Example["https://example.com/ref1.jpg", "https://example.com/ref2.jpg"]

video_urlsarrayOptional

Reference video URLs (0-3 videos, total ≤ 15s).

Notes

Formats: mp4, mov
Resolution: 480p or 720p
Duration: 2-15s per video, total ≤ 15s
Max 50MB per video, 24-60 FPS
Video input duration is billed at reduced rate

Example["https://example.com/reference.mp4"]

audio_urlsarrayOptional

Reference audio URLs (0-3 tracks, total ≤ 15s).

Notes

Formats: wav, mp3
Duration: 2-15s per track, total ≤ 15s
Max 15MB per audio
Cannot use audio alone — must include at least 1 image or video

Example["https://example.com/bgm.mp3"]

durationintegerOptionalDefault: 5

Output video duration in seconds.

Value	Description
4-15	Any integer between 4 and 15
-1	Smart duration (model decides)

Example10

qualitystringOptionalDefault: 720p

Video resolution.

Value	Description
480p	Lower resolution
720p	Standard (default)

Example720p

aspect_ratiostringOptionalDefault: 16:9

Video aspect ratio. 'adaptive' auto-selects based on input (video > image > prompt).

Value	Description
16:9	Landscape
9:16	Portrait
1:1	Square
adaptive	Auto-detect from input

Example16:9

generate_audiobooleanOptionalDefault: true

Whether to generate synchronized audio. No extra charge.

Exampletrue

callback_urlstringOptional

HTTPS callback URL for task completion notification.

Examplehttps://your-domain.com/webhooks/video-done

Request Example (Multimodal Reference)

{
  "model": "seedance-2.0-reference-to-video",
  "prompt": "Use video 1 camera movement, audio 1 as background music. Product showcase ad.",
  "image_urls": ["https://example.com/ref1.jpg", "https://example.com/ref2.jpg"],
  "video_urls": ["https://example.com/reference.mp4"],
  "audio_urls": ["https://example.com/bgm.mp3"],
  "duration": 10,
  "quality": "720p",
  "aspect_ratio": "16:9"
}

Request Example (Video Extension)

{
  "model": "seedance-2.0-reference-to-video",
  "prompt": "Continue the scene, camera enters the room through the window",
  "video_urls": [
    "https://example.com/part1.mp4",
    "https://example.com/part2.mp4"
  ],
  "duration": 8,
  "generate_audio": true
}

Response Example

{
  "created": 1761313744,
  "id": "task-unified-1761313744-abc123",
  "model": "seedance-2.0-reference-to-video",
  "object": "video.generation.task",
  "progress": 0,
  "status": "pending",
  "type": "video"
}