lucataco / sadtalker

Stylized Audio-Driven Single Image Talking Face Animation

  • Public
  • 18.6K runs
  • L40S
  • GitHub
  • Paper
  • License

Input

*file
Preview
source_image

Upload the source image, it can be video.mp4 or picture.png

*file
Preview
Video Player is loading.
Current Time 00:00:000
Duration 00:00:000
Loaded: 0%
Stream Type LIVE
Remaining Time 00:00:000
 
1x

Upload the driven audio, accepts .wav and .mp4 file

string

Choose a face enhancer

Default: "gfpgan"

string

how to preprocess the images

Default: "full"

file

path to reference video providing eye blinking

file

path to reference video providing pose

boolean

can crop back to the original videos for the full body aniamtion when preprocess is full

Default: true

Output

Generated in

Run time and cost

This model costs approximately $0.15 to run on Replicate, or 6 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia L40S GPU hardware. Predictions typically complete within 149 seconds. The predict time for this model varies significantly based on the inputs.

Readme

Re-upload of cjwbw/sadtalker to run on an A40

This model also contains an experimental feature, to select None for enhancer

Support

Give me a follow if you like my work! @lucataco93