heygen/avatar-iv

Create realistic talking avatar videos from text with HeyGen's Avatar IV engine

43 runs

HeyGen Avatar IV

Generate realistic talking avatar videos from text using HeyGen’s Avatar IV engine. Provide an avatar, a voice, and a script — get back a video of the avatar speaking your text with natural motion and expressions.

Overview

Avatar IV is HeyGen’s latest avatar engine, producing higher-quality motion, expressions, and lip sync compared to Avatar III. It supports public avatars, digital twins, and photo avatars.

Inputs

  • avatar_id — The avatar to use. Get available IDs from HeyGen’s List All Avatars API.
  • input_text — The script the avatar will speak (up to 5,000 characters).
  • voice_id — The voice to use. Get available IDs from HeyGen’s List All Voices API.
  • avatar_style — Visual framing: normal, closeUp, or circle.
  • voice_speed — Speech speed from 0.5× to 1.5×.
  • voice_emotion — Optional emotion: Excited, Friendly, Serious, Soothing, or Broadcaster.
  • caption — Enable on-screen captions.
  • width / height — Output video dimensions (default 1920×1080).

Output

Returns an MP4 video of the avatar speaking the provided text.

Use cases

  • Marketing — Create personalized video ads at scale.
  • Education — Build multilingual course videos with consistent presenters.
  • Sales — Generate personalized outreach videos.
  • Support — Produce how-to videos with a friendly face.
  • Social media — Create talking-head content without a camera.
Model created
Model updated