heygen/lipsync-speed

Fast lip-sync: replace or dub audio on any video with quick audio-driven lip sync

9 runs

Readme

HeyGen Lipsync Speed

Replace or dub audio on an existing video with fast, audio-driven lip sync. Give it a video and new audio, and it re-animates the speaker’s lip movements to match.

Overview

Lipsync Speed is optimized for fast turnaround. It uses audio-only analysis to re-sync lip movements, making it ideal for quick dubbing, voiceover replacement, and content localization workflows where speed matters more than pixel-perfect accuracy.

How it works

  1. Upload a source video containing a speaking person
  2. Upload replacement audio (new dialogue, translated speech, etc.)
  3. The model re-animates the speaker’s lips to match the new audio
  4. Get back a video with synchronized lip movements

Use cases

  • Dubbing: Replace dialogue in videos with different languages
  • Voiceover replacement: Swap narration while keeping the visual in sync
  • Content localization: Quickly adapt videos for different markets
  • Audio correction: Fix audio issues while maintaining lip sync

Options

  • Dynamic duration: Let the output video length adjust to match the new audio (on by default)
  • Music removal: Strip background music from the source video
  • Speech enhancement: Improve speech clarity in the output

Pricing

Billed per second of output video at $0.0333/second.

For higher-quality lip sync with avatar inference, see heygen/lipsync-precision.

Model created
Model updated