You're looking at a specific version of this model. Jump to the model overview.

zsxkib /create-video-dataset:97a086d1

Input

string
Shift + Return to add a new line

YouTube/video URL to process. Leave empty if uploading a file. Note: URL takes precedence if both URL and file are provided.

file

Video file to process. Leave empty if using URL. Ignored if URL is provided.

string

Scene detection method: 'content' (fast cuts), 'adaptive' (camera movement), or 'threshold' (fades)

Default: "content"

number

Minimum scene length in seconds

Default: 1

number

Maximum scene length in seconds

Default: 10

integer

Number of scenes to extract (0 = all detected scenes)

Default: 4

number

Target frame rate (e.g. 24, 30). Set to -1 to keep original fps.

Default: 24

number

Start time in seconds for video processing

Default: 0

number

End time in seconds for video processing. Set to 0 to process until the end.

Default: 0

boolean

Automatically skip first 10 seconds (typical intro)

Default: false

boolean

Generate scene previews without creating full dataset

Default: false

string

Video quality preset: 'fast' (lower quality, smaller files), 'balanced', or 'high' (best quality, larger files)

Default: "balanced"

boolean

Let AI generate a caption for your video. If False, you must provide custom_caption.

Default: true

string

Caption style: 'minimal' (short), 'detailed' (longer descriptions), or 'custom'

Default: "detailed"

string
Shift + Return to add a new line

Your custom caption. Required if caption_style is 'custom' or autocaption is False.

string
Shift + Return to add a new line

Trigger word to include in captions (e.g., TOK, STYLE3D). Will be added at start of caption.

Default: "TOK"

string
Shift + Return to add a new line

Text to add BEFORE caption. Example: 'a video of'

string
Shift + Return to add a new line

Text to add AFTER caption. Example: 'in a cinematic style'

Output

No output yet! Press "Submit" to start a prediction.