You're looking at a specific version of this model. Jump to the model overview.

chenxwh/sadtalker:719575c0

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
source_image
string
Upload the source image, it can be video.mp4 or picture.png
driven_audio
string
Upload the driven audio, accepts .wav and .mp4 file
enhancer
string (enum)
gfpgan

Options:

gfpgan, RestoreFormer

Choose a face enhancer
preprocess
string (enum)
full

Options:

crop, resize, full

how to preprocess the images
ref_eyeblink
string
path to reference video providing eye blinking
ref_pose
string
path to reference video providing pose
still
boolean
True
can crop back to the original videos for the full body aniamtion when preprocess is full

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}