Seedance 2.0 API
Price: $0.056 - 0.496(~ 3.834 - 33.75 credits) per second of video
With video input: billed on (input+output) duration at reduced rate. Min billable input = output duration.
Highest stability with guaranteed 99.9% uptime. Recommended for production environments.
Use the same API endpoint for all versions. Only the model parameter differs.
Up to 9 images (jpg/png/webp, ≤30MB each, 300-6000px, aspect ratio 0.4-2.5). Use prompt to assign roles (e.g., "image 1 as first frame"). Total request body ≤ 64MB.
Click to upload or drag and drop
Supported formats: JPG, JPEG, PNG, WEBP
Maximum file size: 30MB; Maximum files: 9
Up to 3 videos (mp4/mov, ≤50MB each, 480p/720p, 2-15s per clip, total ≤15s, 24-60 FPS). Aspect ratio 0.4-2.5, 300-6000px per side, frame pixels 409,600-927,408. Video input duration is billed.
Click to upload or drag and drop
Supported formats: MP4, MOV
Maximum file size: 50MB; Maximum files: 3
Up to 3 tracks (mp3/wav, ≤15MB each, 2-15s per clip, total ≤15s). ⚠️ Cannot be used alone — at least 1 image or video is required.
Click to upload or drag and drop
Supported formats: MP3, WAV
Maximum file size: 15MB; Duration: max 16s
Generate synchronized audio. No extra charge.
History
Max 20 items0 running · 0 completed
Billing Rules
- •Without video input: billed per second of output video only.
- •With video input: input video duration is also billed — both input and output seconds are charged at a reduced per-second rate (see pricing table).
- •Minimum billable input duration = output duration (e.g., 3s input + 10s output → billed as 10s + 10s = 20s).
- •Output duration range: 4–15 seconds.
- •Audio generation: included at no extra charge.
- •Supports: up to 9 images, 3 videos (≤15s total), 3 audio tracks (≤15s total).
Pricing
| Model | Mode | Quality | Video Input | Price |
|---|---|---|---|---|
| Seedance 2.0 Reference to Video | Video Generation | 480p | No | $0.092/ second(6.2775 Credits) |
| Seedance 2.0 Reference to Video | Video Generation | 720p | No | $0.199/ second(13.5 Credits) |
| Seedance 2.0 Reference to Video | Video Generation | 1080p | No | $0.496/ second(33.75 Credits) |
| Seedance 2.0 Reference to Video | Video Generation | 480p | Yes (input+output) | $0.056/ second(3.834 Credits) |
| Seedance 2.0 Reference to Video | Video Generation | 720p | Yes (input+output) | $0.121/ second(8.2215 Credits) |
| Seedance 2.0 Reference to Video | Video Generation | 1080p | Yes (input+output) | $0.302/ second(20.52 Credits) |
If it's down, we automatically use the next cheapest available—ensuring 99.9% uptime at the best possible price.
ByteDance Seedance 2.0 API and AI Video Generator
Use the Seedance 2.0 API to run ByteDance Seedance as an AI video generator for text, image, video, and audio inputs — including real human video generation with lifelike faces, expressions, and full-body motion. This Seedance video generator adds the @-reference system, synchronized audio, video-to-video editing, and up to 15-second generation, while the Pricing tab helps you compare current Seedance price options.

What can you build with Seedance 2.0 API?
Seedance 2.0 Reference-Driven Video Production
With Seedance 2.0 API, upload a reference video and the model extracts camera movement, motion dynamics, and transition style via the @-reference system. Generate dozens of on-brand video variations from one hero clip without reshooting. Supports up to 3 video references per request for combining camera from one source, motion from another, and style from a third.
Seedance 2.0 Audio-Synced Content with Reference Audio
Seedance 2.0 API lets you provide up to 3 audio tracks as references. Seedance 2.0 aligns cuts, motion energy, and scene transitions to beat and rhythm. The output includes native synchronized audio: dialogue lip-syncs in multiple languages, sound effects match on-screen action, and background music follows the mood of your audio reference.
Seedance 2.0 Multi-Reference Storyboard to Video
Seedance 2.0 lets you combine up to 9 reference images with text prompts to control composition, character, and environment across shots. Seedance 2.0 fills the gaps between keyframes with consistent identity, lighting, and style. Ideal for ad production, product demo sequences, and animated storyboards.
Seedance 2.0 Real Human Video Generation
Seedance 2.0 now fully supports real human video generation through EvoLink's API. Upload a portrait photo and generate video with lifelike facial expressions, natural micro-expressions, full-body motion including dance and athletics, and multi-language lip-synced dialogue. Ideal for face-led ads, spokesperson content, influencer-style creative, and realistic portrait storytelling.
How Seedance 2.0 Compares - All models on one EvoLink API key
Seedance 2.0 leads with multimodal @-reference inputs, video-to-video editing, and the highest image reference count among major video generation models.
Seedance 2.0 API Multimodal @-Reference System
Seedance 2.0 is the only model supporting video, audio, and image references in a single request. Upload up to 9 images + 3 videos + 3 audio tracks to control camera, motion, rhythm, and style - capabilities unavailable in Sora 2, Kling 3.0, or Seedance 1.5 Pro.
Seedance 2.0 API Video-to-Video Editing
Seedance 2.0 API enables editing specific segments, characters, or actions in existing videos. Seedance 2.0 supports targeted V2V modifications - a feature not available in Sora 2 or Kling 3.0.
Seedance 2.0 API Real Human Video Support
Seedance 2.0 on EvoLink fully supports real human video generation — upload a portrait photo and produce video with lifelike expressions, full-body motion, and multi-language lip-sync. This is a capability that remains limited or restricted on competing platforms like Kling 3.0 and Sora 2.
Seedance 2.0 API Competitive Pricing via EvoLink
Access Seedance 2.0 API through EvoLink's unified API with competitive per-second pricing. If you are comparing Seedance price, one API key gives you access to ByteDance Seedance, Sora 2, Kling 3.0, Veo 3.1, and more - with automatic routing to the best provider.
Seedance 2.0 vs Kling 3.0 vs Sora 2
| Feature | Seedance 2.0 | Kling 3.0 | Sora 2 |
|---|---|---|---|
| EvoLink price | From $0.092/s | $0.079/s | $0.08/s |
| Current route quality | 480p / 720p / 1080p | 720p / 1080p | 720p |
| Native audio | Yes | AI sound effects | Yes (synchronized) |
| Reference control | Text + image + video + audio | Text + image | Text + image |
| Video length | 4-15s | 3-15s | 4 / 8 / 12s |
| Real human video | Full support | Limited | Limited |
| Best for | Premium multimodal control, directed production, real human video | General-purpose video, per-second billing | OpenAI ecosystem, longer clips |
How to Integrate Seedance 2.0 API
Seamlessly integrate multimodal video generation into your app with EvoLink's unified API. Supports T2V, I2V, and V2V modes.
Step 1 - Get Your API Key
Sign up on EvoLink.ai and generate your secure API key from the dashboard. This key authenticates all your requests to the Seedance 2.0 endpoint.
Step 2 - Submit Generation Task
Send a POST request to `/v1/videos/generations` with your text prompt, image URLs, or video/audio references. Use the `references` parameter to pass video, audio, and image inputs for the @-reference system. The API processes this asynchronously and returns a task `id` for tracking.
Step 3 - Retrieve Video Result
Use the `task_id` to poll the status endpoint or configure a webhook. Once completed, you'll receive a secure URL to download your synchronized audio-video file in MP4+AAC format.
Seedance 2.0 API Capabilities
Technical specifications for multimodal video production
@-Reference System
Upload up to 9 images + 3 videos + 3 audio references per request. The model extracts camera paths, motion patterns, rhythm, and style from source media.
Up to 1080p Output
The current EvoLink Seedance 2.0 route exposes 480p, 720p, and 1080p quality options, depending on mode and pricing.
4-15s Duration
Supports variable video lengths from 4 to 15 seconds, with multi-shot consistency for longer narrative sequences.
Text, Image, Video & Audio Input
Supports T2V (text-to-video), I2V (image-to-video), and V2V (video-to-video) generation modes with combinable multimodal inputs.
Native Audio with Lip-Sync
Generates synchronized dialogue, sound effects, and background music. Lip-sync support for multiple languages.
Real Human Video Generation
Generate realistic videos of real people from reference photos. Supports lifelike facial expressions, micro-expressions, full-body motion, and multi-language lip-synced dialogue.
Commercial Rights
Commercial usage rights subject to BytePlus terms, enabled securely through the EvoLink platform.
Seedance 2.0 API Frequently Asked Questions
Everything you need to know about the product and billing.
All Seedance API Models
EvoLink provides unified API access to the full Seedance model family: All models share the same Seedance API endpoint. Switch models with one parameter.
API Reference
Select endpoint
Authentication
All APIs require Bearer Token authentication.
Authorization:
Bearer YOUR_API_KEY/v1/videos/generationsCreate Reference-to-Video
Multimodal reference-to-video generation with image, video, and audio references. Supports new generation, video editing, and video extension.
Now supports AIGC-generated realistic human materials.
Asynchronous processing — use the returned task ID to query status. Video links are valid for 24 hours.
Request Parameters
modelstringRequiredDefault: seedance-2.0-reference-to-videoFixed value: seedance-2.0-reference-to-video
seedance-2.0-reference-to-videopromptstringOptionalText prompt describing the video. Use natural language to assign roles to references.
Notes
- Example: 'Use video 1 camera movement, image 1 as style reference, audio 1 as background music'
- Chinese: ≤ 500 characters, English: ≤ 1000 words
Use the camera movement from video 1, with image 1 as style referenceimage_urlsarrayOptionalReference image URLs (0-9 images).
Notes
- Formats: .jpeg, .png, .webp
- Aspect ratio (width/height): 0.4 ~ 2.5
- Width/height pixels: 300 ~ 6000 px
- Max 30MB per image
- Total request body ≤ 64MB, do not use Base64
- Cannot provide only audio_urls — at least 1 image or 1 video is required
["https://example.com/ref1.jpg", "https://example.com/ref2.jpg"]video_urlsarrayOptionalReference video URLs (0-3 videos).
Notes
- Formats: .mp4, .mov
- Resolution: 480p or 720p
- Duration: 2-15s per video, max 3 videos, total ≤ 15s
- Aspect ratio (width/height): 0.4 ~ 2.5
- Width/height pixels: 300 ~ 6000 px
- Frame pixels (width × height): 409,600 ~ 927,408 (e.g., 640×640 ~ 834×1112)
- Max 50MB per video, 24-60 FPS
- Video input duration is counted in billing
["https://example.com/reference.mp4"]audio_urlsarrayOptionalReference audio URLs (0-3 clips).
Notes
- Formats: .wav, .mp3
- Duration: 2-15s per clip, max 3 clips, total ≤ 15s
- Max 15MB per clip
- Cannot be used alone — at least 1 image or 1 video must be included
["https://example.com/bgm.mp3"]durationintegerOptionalDefault: 5Output video duration in seconds.
| Value | Description |
|---|---|
| 4-15 | Any integer between 4 and 15 |
Notes
- Duration directly affects billing
10qualitystringOptionalDefault: 720pVideo resolution.
| Value | Description |
|---|---|
| 480p | Lower resolution |
| 720p | Standard (default) |
| 1080p | High quality |
720paspect_ratiostringOptionalDefault: 16:9Video aspect ratio. 'adaptive' auto-selects based on input (video > image > prompt).
| Value | Description |
|---|---|
| 16:9 | Landscape (1280×720 / 864×496) |
| 9:16 | Portrait (720×1280 / 496×864) |
| 1:1 | Square (960×960 / 640×640) |
| 4:3 | Standard (1112×834 / 752×560) |
| 3:4 | Portrait standard (834×1112 / 560×752) |
| 21:9 | Ultrawide (1470×630 / 992×432) |
| adaptive | Auto-detect from input |
16:9generate_audiobooleanOptionalDefault: trueWhether to generate synchronized audio. No extra charge.
truecallback_urlstringOptionalHTTPS callback URL for task completion notification.
https://your-domain.com/webhooks/video-done