Fast-tier image-to-video with optional start-to-end frame transitions, flexible duration and aspect ratio, resolution up to 720p, and optional synchronized…
Use this file to discover all available pages before exploring further.
Try Seedance 2.0 Fast - Image to Video in the Workbench
Run this model interactively, tune parameters, and compare outputs.
Model ID:bytedance-seedance-2-0-fast-image-to-videoByteDance Seedance 2 Fast image-to-video animates a starting frame from a text motion prompt, with optional end-frame control for transitions. This is the enterprise fast tier with lower latency and cost. It supports 480p, or 720p output, durations from 4–15 seconds or automatic length from the prompt, multiple aspect ratios (including auto from the input image), and synchronized audio (effects, ambience, and lip-synced speech).
Use the Workbench as a request builder: configure parameters for this model in the UI, then open the API tab to copy the exact cURL or Python call.
Sync
Async
Async with SSE
This blocks until the video is ready (typically 5-15 minutes). Prefer Async or Async with SSE for anything beyond quick experimentation.See the video generation reference for more details.
The URL of the image to use as the last frame of the video. When provided, the generated video will transition from the starting image to this ending image. Supported formats: JPEG, PNG, WebP. Max 30 MB. Format: uri.
resolution
string
"720p"
Video resolution - 480p for faster generation, 720p for better quality. One of: 480p, 720p.
duration
string
"auto"
Duration of the video in seconds. Supports 4 to 15 seconds, or auto to let the model decide based on the prompt. One of: auto, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15.
aspect_ratio
string
"auto"
The aspect ratio of the generated video. Use 16:9 for landscape, 9:16 for portrait/vertical, 1:1 for square, 21:9 for ultrawide cinematic, or auto to infer from the input image. One of: auto, 21:9, 16:9, 4:3, 1:1, 3:4, 9:16.
generate_audio
boolean
true
Whether to generate synchronized audio for the video, including sound effects, ambient sounds, and lip-synced speech. The cost of video generation is the same regardless of whether audio is generated or not.
seed
integer
—
Random seed for reproducibility. Note that results may still vary slightly even with the same seed.
⌘I
Assistant
Responses are generated using AI and may contain mistakes.