What is the WAN 2.7 API?
WAN 2.7 is Alibaba's video generation and editing model. The AI Video API exposes it through one HTTP endpoint (POST /v1/videos/generations) with credits, webhooks, and logs shared across all supported video models.
Generate and edit WAN 2.7 video with synced audio.
WAN 2.7 supports prompt-based generation, first/last-frame animation, multi-reference generation, and instruction-based video editing with native audio. Run it through the same task, polling, callback, and credit balance used by the rest of your model stack.
Model capabilities
Use WAN 2.7 from the same platform surface as the rest of your video and image stack: API keys, credits, logs, webhooks, and docs stay consistent across providers.
Create clips from prompts with configurable aspect ratio, duration, and reference audio.
Drive motion from first and last frame images, with optional driving audio.
Blend reference images and videos (up to 5 combined) into a guided scene with a reference voice.
Edit a source video with text instructions and a reference image, preserving or regenerating audio.
Choose 720p for efficient generation or 1080p for higher-fidelity output.
Use text, first/last-frame images, references, or a source video.
Pick 720p or 1080p and a target duration per mode.
Use task IDs, logs, and callbacks for operational delivery.
curl -X POST https://api.aivideoapi.ai/v1/videos/generations \
-H "Authorization: Bearer sk-your-api-key" \
-H "Content-Type: application/json" \
-d '{
"model": "wan-2.7-text-to-video",
"input": {
"prompt": "A cinematic tracking shot through a rainy neon street at night",
"resolution": "1080p",
"aspect_ratio": "16:9",
"duration": 5
}
}'Frequently asked questions
WAN 2.7 is Alibaba's video generation and editing model. The AI Video API exposes it through one HTTP endpoint (POST /v1/videos/generations) with credits, webhooks, and logs shared across all supported video models.
Four: text-to-video (wan-2.7-text-to-video), image-to-video (wan-2.7-image-to-video), reference-to-video (wan-2.7-reference-to-video), and video editing (wan-2.7-video-edit).
Pricing is per second by resolution: 720p is 24 credits/s and 1080p is 40 credits/s. For video editing, duration=0 bills by the full input video length; a value of 2-10 bills by that clip length.
Yes. Pass a callback_url when creating the task and the platform will POST the completed result (or error) to that URL. Status is also pollable via GET /v1/tasks/{taskId}.
Pricing and usage
Efficient generation
Higher-fidelity output
Output seconds × rate
duration=0 bills the full input video
Create one API key, use one credit balance, and switch between video and image models without provider-specific plumbing.