Wan 2.6 Image To Video API
Turn any image into a polished 1080p video with audio so your posts, ads, and stories get noticed instead of skipped.
Animate this image with smooth motion
No sample available
Upload audio for video generation (3-30 seconds, MP3/WAV)
Click to upload or drag and drop
Supported formats: MP3, WAV
Maximum file size: 50MB; Duration: 3-30s
Upload reference images
Click to upload or drag and drop
Supported formats: JPG, JPEG, PNG, WEBP
Maximum file size: 10MB; Maximum files: 10
Click Generate to see preview
History
Max 20 items0 running · 0 completed
wan 2.6 image to video made for social media creators
Upload an image, describe your idea, and let wan 2.6 image to video turn it into a cinematic short clip that is ready to post, promote, and share.

What is wan 2.6 image to video?
AI image to video in a few clicks
wan 2.6 image to video turns a single photo (product shots, portraits, or brand visuals) into a short, smooth video with optional audio—great when you need fresh creatives fast. It’s part of Alibaba’s Wan2.6 series, which focuses on cinematic results with multi-shot storytelling and stronger instruction following. You can run the same image multiple times to generate variations, making it easy to test different hooks for Reels, TikTok, Shorts, and ads without heavy editing.

Built for social feeds, stories, and ads
wan 2.6 image to video is made for short-form platforms where you have seconds to earn attention. Start from a product photo, lifestyle image, or selfie, then guide camera movement and vibe with a short prompt. The Wan2.6 series highlights multi-shot storytelling and improved audio-visual synchronization, helping clips feel more like real video instead of static content. The result is a simple way to produce post-ready and ad-ready variations from assets you already have.

Image to video for education and storytelling
wan 2.6 image to video also works for explainers and lessons: turn diagrams, slides, or screenshots into short video clips that hold attention longer. With up to 15-second outputs, you have enough time for a clear mini-story or step-by-step concept. For course creators and educators, this means more engaging content without a complex editing workflow—reuse what you already have, then add motion with prompts.

Why creators choose wan 2.6 image to video
wan 2.6 image to video helps creators ship cinematic short videos faster, with multi-shot storytelling and stronger audio-visual sync.
Create more content, faster
Turn one good image into multiple short video variations for posts and ads, without a full filming or editing workflow.
More cinematic, more consistent
Wan2.6 focuses on multi-shot storytelling, better instruction following, and improved visual consistency for professional-looking clips.
Richer scenes with sound
Wan2.6 highlights audio-visual synchronization and richer sound effects, so outputs feel more lifelike and engaging.
How to use wan 2.6 image to video
Call the API, get a task ID, then poll task status to retrieve the video URL (valid for 24 hours).
Prepare inputs and API key
Get your Bearer API key from EvoLink dashboard. Host your image on a publicly accessible URL. Supported formats: JPEG, JPG, PNG, BMP, WEBP (max 10MB, 360-2000px). Write a text prompt (max 1500 characters).
Create a generation task
POST to the videos/generations endpoint with model name, prompt, and image URL. Optionally set duration (5, 10, or 15 seconds), quality (720p or 1080p), prompt enhancement, and callback URL.
Query status and download video
Use the returned task ID to check status until completed. Fetch the output video link and download it promptly. The link expires in 24 hours.
Key benefits of wan 2.6 image to video
Create better short videos from images with less time and fewer steps.
1080p, 24fps output
Generate clean 720p or 1080p short videos that look sharp on modern feeds.
Image + prompt + optional audio
Start from one image, add a prompt, and optionally include an audio URL to guide the vibe.
Cinematic motion, plain language
Describe simple moves like zoom, pan, or reveal to get video‑like motion without editing.
Multi‑shot storytelling
Create short sequences that feel like real scenes, not a basic slideshow.
Fast creative iteration
Generate multiple variations quickly to test hooks, styles, and ad angles.
Social‑ready delivery
Get shareable clips you can drop into ads, posts, and content calendars right away.
Wan 2.6 vs Kling 2.6 vs Veo 3.1
Quick pricing + resolution notes (based on fal listed usage pricing).
| Model | Duration | Resolution | Price | Strength |
|---|---|---|---|---|
| Wan v2.6 (Image-to-Video) | N/A | 720p / 1080p | $0.10/s (720p), $0.15/s (1080p) on fal | Cost-effective i2v with clear HD tiers for social content. |
| Kling 2.6 Pro (Image-to-Video) | N/A | HD (not explicitly stated on fal page) | $0.07/s (audio off), $0.14/s (audio on) on fal | Cinematic motion plus native audio generation for dialogue-style clips. |
| Veo 3.1 (Text-to-Video) | N/A | HD (not explicitly stated on fal page) | $0.20/s (audio off), $0.40/s (audio on) on fal | Premium t2v quality with audio, good for higher-end brand storytelling. |
wan 2.6 image to video FAQ
Everything you need to know about the product and billing.
API Reference
Select endpoint
Authentication
All APIs require Bearer Token authentication.
Authorization:
Bearer YOUR_API_KEY/v1/videos/generationsCreate Video
WAN2.6 (wan2.6-image-to-video) model supports first-frame image-to-video generation.
Asynchronous processing mode, use the returned task ID to .
Generated video links are valid for 24 hours, please save them promptly.
Request Parameters
modelstringRequiredDefault: wan2.6-image-to-videoModel name.
wan2.6-image-to-videopromptstringRequiredPrompt describing the video you want to generate.
Notes
- Limited to 1500 characters
A cat playing pianoimage_urlsarrayRequiredReference image URL list for first-frame image-to-video generation.
Notes
- Single request supports 1 image
- Image size: no more than 10MB
- Supported formats: .jpeg, .jpg, .png (transparent channel not supported), .bmp, .webp
- Image resolution: width and height range is [360, 2000] pixels
- Image URL must be directly accessible by the server
https://example.com/image1.pngdurationintegerOptionalDefault: 5Specifies the duration of the generated video (in seconds).
| Value | Description |
|---|---|
| 5 | 5 seconds duration |
| 10 | 10 seconds duration |
| 15 | 15 seconds duration |
Notes
- Each request will be pre-charged based on the duration value, actual charge is based on the generated video duration
5qualitystringOptionalDefault: 720pVideo quality. 1080p costs 1.67x more than 720p.
| Value | Description |
|---|---|
| 720p | Standard definition, standard price (default) |
| 1080p | High definition, 1.67x price |
720pprompt_extendbooleanOptionalDefault: trueWhether to enable intelligent prompt rewriting. When enabled, a large model will optimize the prompt, which significantly improves results for simple or insufficiently descriptive prompts.
| Value | Description |
|---|---|
| true | Enable intelligent prompt rewriting (default) |
| false | Disable intelligent prompt rewriting |
truemodel_paramsobjectOptionalModel parameter configuration.
model_params.shot_typestringOptionalDefault: singleSpecifies the shot type for the generated video.
| Value | Description |
|---|---|
| single | Outputs single-shot video (default) |
| multi | Outputs multi-shot video |
Notes
- Only effective when prompt_extend is true
- shot_type priority > prompt priority
singlecallback_urlstringOptionalHTTPS callback URL for task completion.
Notes
- Triggered when task is completed, failed, or cancelled
- Sent after billing confirmation
- Only HTTPS protocol is supported
- Callbacks to internal IP addresses are prohibited
- URL length must not exceed 2048 characters
- Timeout: 10 seconds, Max 3 retries
https://your-domain.com/webhooks/video-task-completed