You're looking at a specific version of this model. Jump to the model overview.

kwaivgi /kling-lip-sync:2aa2ba45

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
video_url
string
URL of a video for lip syncing. It can be an .mp4 or .mov file, should be less than 100MB, with a duration of 2-10 seconds, and a resolution of 720p-1080p (720-1920px dimensions). Cannot be used with video_id.
audio_file
string
Audio file for lip sync. Must be .mp3, .wav, .m4a, or .aac and less than 5MB.
text
string
Text content for lip sync (if not using audio)
voice_id
None
None
Voice ID for speech synthesis (if using text and not audio)
voice_speed
number
1

Min: 0.8

Max: 2

Speech rate (only used if using text and not audio)
video_id
string
ID of a video generated by Kling. Cannot be used with video_url.

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}