Multimodal video generation API

Seedance 2.0 API

Reference-guided video generation with text, image, video, and audio inputs.

The Seedance 2.0 API is ByteDance Seed's multimodal video generation API for cinematic short clips guided by text, image, video, and audio references. Preserve subject identity, composition, and motion while changing lighting, style, weather, camera feel, or specific scene elements.

Try playground
5-8 min
Generation speed
4-15s
Duration range
480p-1080p
Resolution
ByteDance
doubao-seedance-2.0async task
5-8 min
Generation speed
4-15s
Duration range
480p-1080p
Resolution

Model capabilities

Built for production API workflows, not one-off demos.

Use Seedance 2.0 from the same platform surface as the rest of your video and image stack: API keys, credits, logs, webhooks, and docs stay consistent across providers.

Reference-guided creation

Use product shots, character images, first and last frames, or style references to control visual direction.

Motion and camera transfer

Attach short reference clips so the model can follow movement, composition, rhythm, and transition intent.

Audio-aware generation

Provide music or sound references, or enable synchronized audio generation for richer short-form output.

Multi-shot storytelling

Create short cinematic sequences with stronger scene flow, subject consistency, and production-ready pacing.

API workflow

Submit tasks, track progress, and return generated assets.

01

Submit a generation task

Send prompt, references, duration, aspect ratio, resolution, and callback URL to the video generation endpoint.

02

Track async progress

Use the returned task ID to poll status, or let the callback deliver completion and failure events.

03

Receive hosted output

Completed tasks return mirrored video URLs and metadata through the same response shape used by other models.

POST /v1/videos/generations
curl -X POST https://api.aivideoapi.ai/v1/videos/generations \
  -H "Authorization: Bearer sk-your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "doubao-seedance-2.0",
    "input": {
      "prompt": "A cinematic product reveal with smooth camera motion",
      "duration": 8,
      "resolution": "1080p",
      "ratio": "16:9"
    }
  }'
What teams build
Product reveal videos
Short social ads
Character-driven clips
Music and beat-matched edits
Storyboard to video drafts
Reference video remixes

Frequently asked questions

Answers about the Seedance 2.0 API.

What is the Seedance 2.0 API?

The Seedance 2.0 API is a multimodal video generation API by ByteDance Seed, exposed through AI Video API. It generates 4 to 15 second cinematic videos at 480p, 720p, or 1080p from text prompts plus optional image, video, and audio references, and returns hosted output URLs through callbacks or polling.

How do I get a Seedance 2.0 API key?

Sign in to AI Video API, open the dashboard, and create an API key. The same key works for Seedance 2.0, Seedance 2.0 Fast, and every other video and image model on the platform. Pay-as-you-go credits start working immediately after sign-up.

How much does the Seedance 2.0 API cost?

Seedance 2.0 is priced by output duration and resolution in credits per second: 18 cr/s at 480p, 38 cr/s at 720p, and 95 cr/s at 1080p. With a video reference attached, pricing drops to 12, 25, and 62 cr/s respectively. Seedance 2.0 Fast is 14.5 cr/s at 480p and 30.5 cr/s at 720p.

What inputs does the Seedance 2.0 API support?

Seedance 2.0 supports text prompts, image references (first frame, last frame, product shots, character images, or style frames), short video references for motion and camera rhythm, and audio references for beat-matched generation. You can also enable native synchronized audio generation in the output clip.

How long does Seedance 2.0 take to generate a video?

A typical Seedance 2.0 task completes in 5 to 8 minutes. Generation is asynchronous: the API returns a task ID immediately, then delivers completion through a callback URL or status polling.

What's the difference between Seedance 2.0 and Seedance 2.0 Fast?

Seedance 2.0 (model id doubao-seedance-2.0) supports the full 480p to 1080p resolution range and is tuned for final production output. Seedance 2.0 Fast (doubao-seedance-2.0-fast) is optimized for 480p and 720p drafts and rapid batch testing at a lower per-second cost.

Does the Seedance 2.0 API generate audio?

Yes. Set generate_audio to true in the request body and Seedance 2.0 will produce synchronized audio with the video. You can also pass a music or sound reference to bias rhythm and tone.

Is there a free trial for the Seedance 2.0 API?

New AI Video API accounts receive a small starting credit balance that covers Seedance 2.0 draft generations at 480p. You can top up credits as you go — no monthly commitment or contract.

Pricing and usage

Clear model options with shared credits.

480p text or image input
18 credits/s

Standard draft generation

480p with video reference
12 credits/s

Lowest-cost reference workflow

720p text or image input
38 credits/s

Balanced production quality

720p with video reference
25 credits/s

Reference-guided 720p

1080p text or image input
95 credits/s

Final cinematic output

1080p with video reference
62 credits/s

Reference-guided 1080p

Start building with Seedance 2.0 in AI Video API.

Create one API key, use one credit balance, and switch between video and image models without provider-specific plumbing.

Read docs