synthesys-ai / synthesys-wav2lip

  • Public
  • 105.6K runs
  • A100 (80GB)
Iterate in playground

Input

*file

video/image that contains faces to use

*file

video/audio file to use as raw audio source

string
Shift + Return to add a new line

Padding for the detected face bounding box. Please adjust to include chin at least Format: "top bottom left right"

Default: "0 10 0 0"

boolean

Smooth face detections over a short temporal window

Default: true

Output

No output yet! Press "Submit" to start a prediction.

Run time and cost

This model costs approximately $0.091 to run on Replicate, or 10 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia A100 (80GB) GPU hardware. Predictions typically complete within 66 seconds. The predict time for this model varies significantly based on the inputs.

Readme

This model doesn't have a readme.