You're looking at a specific version of this model. Jump to the model overview.

skytells-research /lipsync:968cd240

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
face
string
video/image that contains faces to use
audio
string
video/audio file to use as raw audio source
pads
string
0 10 0 0
Padding for the detected face bounding box. Please adjust to include chin at least Format: "top bottom left right"
smooth
boolean
True
Smooth face detections over a short temporal window
fps
number
25
Can be specified only if input is a static image
resize_factor
integer
1
Reduce the resolution by this factor. Sometimes, best results are obtained at 480p or 720p

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}