You're looking at a specific version of this model. Jump to the model overview.

nicolascoutureau /ac:e526d493

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
video
string
Input horizontal video to convert to vertical format
aspect_ratio
None
9:16
Output aspect ratio
speed_preset
None
balanced
Processing speed preset. Fast uses smaller YOLO model and lower resolution analysis.
detect_speaker
boolean
True
Detect and focus on the active speaker using TalkNet ASD (audio-visual neural network). When multiple people are detected, focuses on who is talking. Requires GPU.
tracking_mode
None
smooth
Camera tracking mode. Smooth = cinematic OpusClip-like movement. Static = fixed per-scene.
debug_overlay
boolean
False
Draw debug info on video (scene, strategy, speaker, person details, zoom).

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}