You're looking at a specific version of this model. Jump to the model overview.
devxpy /cog-wav2lip:36e77d44
Input schema
The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.
Field | Type | Default value | Description |
---|---|---|---|
face |
string
|
video/image that contains faces to use
|
|
audio |
string
|
video/audio file to use as raw audio source
|
|
pads |
string
|
0 10 0 0
|
Padding for the detected face bounding box.
Please adjust to include chin at least
Format: "top bottom left right"
|
smooth |
boolean
|
True
|
Smooth face detections over a short temporal window
|
fps |
number
|
25
|
Can be specified only if input is a static image
|
resize_factor |
integer
|
1
|
Reduce the resolution by this factor. Sometimes, best results are obtained at 480p or 720p
|
face_det_batch_size |
integer
|
16
|
Batch size for face detection
|
wav2lip_batch_size |
integer
|
128
|
Batch size for Wav2Lip model(s)
|
Output schema
The shape of the response you’ll get when you run this model with an API.
Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}