HeyGen Avatar IV
Generate realistic talking avatar videos from text using HeyGen’s Avatar IV engine. Provide an avatar, a voice, and a script — get back a video of the avatar speaking your text with natural motion and expressions.
Overview
Avatar IV is HeyGen’s latest avatar engine, producing higher-quality motion, expressions, and lip sync compared to Avatar III. It supports public avatars, digital twins, and photo avatars.
Inputs
- avatar_id — The avatar to use. Get available IDs from HeyGen’s List All Avatars API.
- input_text — The script the avatar will speak (up to 5,000 characters).
- voice_id — The voice to use. Get available IDs from HeyGen’s List All Voices API.
- avatar_style — Visual framing:
normal,closeUp, orcircle. - voice_speed — Speech speed from 0.5× to 1.5×.
- voice_emotion — Optional emotion: Excited, Friendly, Serious, Soothing, or Broadcaster.
- caption — Enable on-screen captions.
- width / height — Output video dimensions (default 1920×1080).
Output
Returns an MP4 video of the avatar speaking the provided text.
Use cases
- Marketing — Create personalized video ads at scale.
- Education — Build multilingual course videos with consistent presenters.
- Sales — Generate personalized outreach videos.
- Support — Produce how-to videos with a friendly face.
- Social media — Create talking-head content without a camera.
Links
Model created
Model updated