ddvinh1/audio-lip:a34f1262 | Run with an API on Replicate

You're looking at a specific version of this model. Jump to the model overview.

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field	Type	Default value	Description
face	string		Input face video or single image
audio	string		Input audio or video (audio will be extracted)
model_variant	None	wav2lip_gan	Choose base or GAN model
static	boolean	False	Use first video frame only (set True for image input)
fps	number	25	FPS if using static image
pads	array	[0, 10, 0, 0]	Padding: top bottom left right
wav2lip_batch_size	integer	128	Batch size
resize_factor	integer	1	Downscale factor for frames before processing
out_height	integer	480	Output video height (e.g., 480 or 720)
crop	array	[0, -1, 0, -1]	Crop region: top bottom left right (-1 auto)
box	array	[-1, -1, -1, -1]	Constant face bbox: top bottom left right (-1 disabled)
rotate	boolean	False	Rotate input video 90 degrees clockwise
nosmooth	boolean	False	Disable bbox smoothing

The shape of the response you’ll get when you run this model with an API.

Schema

{'format': 'uri', 'title': 'Output', 'type': 'string'}