You're looking at a specific version of this model. Jump to the model overview.
Input schema
The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.
| Field | Type | Default value | Description |
|---|---|---|---|
| prompt |
string
|
Text prompt for video generation. Can include references like <<<image_1>>>, <<<video_1>>> to reference inputs.
|
|
| start_image |
string
|
First frame image for the video. Supports .jpg/.jpeg/.png, max 10MB.
|
|
| end_image |
string
|
Last frame image for the video. Requires start_image to be set. Supports .jpg/.jpeg/.png, max 10MB.
|
|
| reference_images |
array
|
Reference images for elements, scenes, or styles (up to 7 without video, 4 with video). Supports .jpg/.jpeg/.png.
|
|
| reference_video |
string
|
Reference video for style, camera movement, or as base for editing. Supports .mp4/.mov, 3-10s duration, max 200MB.
|
|
| video_reference_type |
None
|
feature
|
How to use the reference video: 'feature' for style/camera reference, 'base' for video editing.
|
| keep_original_sound |
boolean
|
True
|
Whether to keep the original sound from the reference video.
|
| mode |
None
|
pro
|
Video generation mode. 'std' is cost-effective, 'pro' has higher quality.
|
| aspect_ratio |
None
|
16:9
|
Aspect ratio of the generated video. Required for text-to-video. Ignored when using first frame image or video editing.
|
| duration |
None
|
5
|
Video duration in seconds. For text/image-to-video: 5 or 10. With video reference (feature type): 3-10. Ignored for video editing (base type).
|
Output schema
The shape of the response you’ll get when you run this model with an API.
Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}