You're looking at a specific version of this model. Jump to the model overview.

devxpy /cog-wav2lip:35318395

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
face
string
video/image that contains faces to use
audio
string
video/audio file to use as raw audio source
pads
string
0 10 0 0
Padding for the detected face bounding box. Please adjust to include chin at least Format: "top bottom left right"
smooth
boolean
True
Smooth face detections over a short temporal window
fps
number
25
Can be specified only if input is a static image
resize_factor
integer
1
Reduce the resolution by this factor. Sometimes, best results are obtained at 480p or 720p
face_det_batch_size
integer
16
Batch size for face detection
wav2lip_batch_size
integer
128
Batch size for Wav2Lip model(s)

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}