Models

Browse available AI models across video, image, chat, and music generation.

Video Generation

Seedance 2.0

Multi-modal video generation — text, image, video, audio inputs

View Docs

From From $0.06/s

doubao-seedance-2.0

Seedance 2.0 Fast

Faster Seedance 2.0 with lower pricing — same capabilities, quicker generation

View Docs

From From $0.05/s

doubao-seedance-2.0-fast

Veo 3.1

Google Veo 3.1 — text-to-video, image-to-video, first & last frame, with background music.

View Docs

From From $0.10/video

veo-3

Sora 2

OpenAI Sora 2 — high-quality text-to-video and image-to-video generation.

View Docs

From $0.1/s

sora-2

Kling 3.0

Kuaishou Kling 3.0 — text-to-video and image-to-video with native audio, 720P/1080P, 3-15 sec.

View Docs

From $0.14/s

kling-3.0

Kling 2.6

Kuaishou Kling 2.6 — image-to-video, 5s/10s, with optional native audio.

View Docs

From $0.55/video

kling-2.6

Image Generation

Nano Banana 2

Google Gemini 3.1 Flash Image — lightning-fast speed with Pro-level quality, 4K output, and text rendering.

View Docs

From $0.08/image

nano-banana-2

GPT Image 2

OpenAI GPT Image 2 — one public model id for text-to-image and image-to-image editing.

View Docs

From $0.012/image

gpt-image-2

Chat Completion

GPT 5.4

OpenAI GPT 5.4 — multimodal reasoning model, 256K context, Responses API.

View Docs

From $1.50/M tokens

gpt-5-4

Claude Opus 4.6

Anthropic Claude Opus 4.6 — flagship reasoning model, 200K context, optional extended thinking.

View Docs

From $2.50/M tokens

claude-opus-4-6

Claude Sonnet 4.6

Anthropic Claude Sonnet 4.6 — balanced speed and intelligence, 200K context, optional thinking.

View Docs

From $1.50/M tokens

claude-sonnet-4-6

Gemini 3 Pro

Google Gemini 3 Pro — flagship multimodal model, 1M token context, OpenAI-compatible.

View Docs

From $0.80/M tokens

gemini-3-pro

Gemini 3.1 Pro

Google Gemini 3.1 Pro — multimodal model, 1M token context, with search grounding.

View Docs

From $0.80/M tokens

gemini-3.1-pro

Gemini 3 Flash

Google Gemini 3 Flash — fast multimodal model, 1M token context, native Gemini protocol.

View Docs

From $0.60/M tokens

gemini-3-flash

Gemini 2.5 Pro

Google Gemini 2.5 Pro — multimodal model, 1M tokens, thinking on by default.

View Docs

From $0.285/M tokens

gemini-2.5-pro

Music Generation

Suno V5.5

Suno V5.5 — AI music generation with tailored custom models for personalized taste.

View Docs

From $0.12/song

suno-v5_5

Suno V5

Suno V5 — AI music generation with superior expression and faster generation.

View Docs

From $0.12/song

suno-v5