Sora 2 API
A serene mountain landscape at golden hour, with mist rolling through the valleys and sunlight breaking through the clouds. The camera slowly pans across the scene, revealing distant peaks and a winding river below.
Upload reference images
Click to upload or drag and drop
Supported formats: JPG, JPEG, PNG, WEBP
Maximum file size: 10MB; Maximum files: 10
History
Max 20 items0 running · 0 completed
Sora 2 API for realistic video with sound
Use the Sora 2 API to turn text into short videos with synchronized dialogue and sound effects, plus stronger instruction following and more realistic motion.

Pricing
| Model | Mode | duration | Price |
|---|---|---|---|
| Sora 2 Beta Max | Video Generation | 10s | Popular $0.0800/ video(5.76 Credits) |
| Sora 2 Beta Max | Video Generation | 15s | $0.1200/ video(8.64 Credits) |
If it's down, we automatically use the next cheapest available—ensuring 99.9% uptime at the best possible price.
What can you build with the Sora 2 API?
Synchronized text-to-video with audio
Use the Sora 2 API to create short videos where dialogue, music, and sound effects align with the visuals. Write a clear prompt, choose size and seconds, and receive a cohesive clip instead of stitched audio. This is ideal for ads, trailers, and social posts where timing matters for marketing teams and creators.

Storyboard-ready scenes and product shots
Build storyboards, product showcases, and explainer visuals with the Sora 2 API. You can provide an optional input_reference image as the first frame to keep the look consistent across variants. This helps teams create on-brand clips quickly while iterating on messaging and call-to-action timing. It works well for landing pages, in-app tours, and product launches.

Physics-aware motion and realism
Use the Sora 2 API for motion that feels grounded. OpenAI reports improved physics accuracy and realism, so objects move more naturally and scenes hold together over time. This makes the model useful for education clips, product demos, and visual explanations where believable motion builds trust. It reduces the need for heavy manual cleanup in post.

Why teams choose the Sora 2 API on EvoLink
Sora 2 API on EvoLink follows OpenAI's documented video generation workflow with clear policy guardrails.
Transparent Usage Pricing
The API reference defines allowed models, clip durations, and output sizes for video generation, so teams can standardize requests and plan usage consistently.
Safe Production Defaults
OpenAI's Sora usage policies require consent for real people and prohibit impersonation or harmful content, giving teams clear boundaries for compliant workflows.
Single Endpoint Access
The video guide shows a single POST /videos workflow with optional input_reference for a first frame, which simplifies integration and keeps creative control in one request.
How to integrate the Sora 2 API
Generate your first Sora 2 API clip in minutes by following the official video generation flow with EvoLink's API key and routing.
Step 1 - Authenticate & Choose Model
Authenticate with your EvoLink key and choose sora-2 or sora-2-pro based on quality and cost needs for each project.
Step 2 - Prompt, Size, and Seconds
Send a prompt describing subject, style, and action, plus optional input_reference, size, and seconds parameters for the clip you want. The API accepts seconds 4, 8, or 12 and sizes 720x1280, 1280x720, 1024x1792, or 1792x1024. If you include an image, match the target resolution and use JPEG, PNG, or WebP.
Step 3 - Retrieve and Publish
Poll for completion, then download the video file with audio and publish or edit it in your workflow immediately.
Sora 2 API capabilities
Highlights from OpenAI's Sora 2 release and official docs (Sep 30, 2025)
Synchronized audio generation
OpenAI describes Sora 2 as a general-purpose video-audio system that can create speech, background soundscapes, and sound effects with strong realism. This helps clips feel complete without separate audio tooling.
Physics-aware realism
OpenAI reports better physics accuracy and more realistic motion in Sora 2, which helps scenes hold together over time. This is useful for demos, education visuals, and product storytelling that depend on credibility.
Multi-shot instruction following
Sora 2 can follow intricate instructions across multiple shots while keeping world state consistent. That means fewer continuity breaks when you describe sequences or camera changes.
Image reference first frame
The video generation guide supports an input_reference image that becomes the first frame. The image must match the target size, and supported formats include JPEG, PNG, and WebP.
Style range
OpenAI highlights that Sora 2 excels at realistic, cinematic, and anime styles. You can guide style with clear prompt cues to match your brand or campaign tone.
API guardrails
The video generation guide lists guardrails such as under-18 suitable content only, no real people, and rejection of copyrighted characters or music. Input images with human faces are also rejected.
Sora 2 API - Frequently Asked Questions
Everything you need to know about the product and billing.
API Reference
Select endpoint
Authentication
All APIs require Bearer Token authentication.
Authorization:
Bearer YOUR_API_KEY/v1/videos/generationsCreate Video
Sora 2 Beta Max (sora-2-beta-max) model supports text-to-video, image-to-video and other modes.
Asynchronous processing mode, use the returned task ID to .
Generated video links are valid for 24 hours, please save them promptly.
Important Notes
- Currently Sora2's content moderation mechanism is very strict, tasks may fail due to this.
- Image input containing real human figures is not currently supported.
Request Parameters
modelstringRequiredDefault: sora-2-beta-maxVideo generation model name.
sora-2-beta-maxpromptstringRequiredPrompt describing what kind of video to generate.
Notes
- Limited to 5000 tokens
A cat playing pianoaspect_ratiostringOptionalDefault: 16:9Video aspect ratio, `16:9` generates landscape video, `9:16` generates portrait video.
| Value | Description |
|---|---|
| 16:9 | Landscape video |
| 9:16 | Portrait video |
16:9image_urlsarrayOptionalReference image URL list for image-to-video feature.
Notes
- Images containing real human figures are not supported
- Max 1 image per request
- Max size: 10MB
- Formats: .jpg, .jpeg, .png, .webp
- URLs must be directly viewable by the server
http://example.com/image1.jpgdurationintegerOptionalDefault: 10Specifies the generated video duration in seconds.
| Value | Description |
|---|---|
| 10 | 10 seconds duration |
| 15 | 15 seconds duration (Higher Cost) |
Notes
- Billing is based on duration
10callback_urlstringOptionalHTTPS callback address after task completion.
Notes
- Triggered on completion, failure, or cancellation
- Sent after billing confirmation
- HTTPS only, no internal IPs
- Max length: 2048 chars
- Timeout: 10s, Max 3 retries
https://your-domain.com/webhooks/video-task-completed