minimax / hailuo-2.3
A high-fidelity video generation model optimized for realistic human motion, cinematic VFX, expressive characters, and strong prompt and style adherence across both text-to-video and image-to-video workflows
minimax / hailuo-2.3-fast
A lower-latency image-to-video version of Hailuo 2.3 that preserves core motion quality, visual consistency, and stylization performance while enabling faster iteration cycles.
minimax / hailuo-02-fast
A low cost and fast version of Hailuo 02. Generate 6s and 10s videos in 512p
minimax / hailuo-02
Hailuo 2 is a text-to-video and image-to-video model that can make 6s or 10s videos at 768p (standard) or 1080p (pro). It excels at real world physics.
minimax / image-01
Minimax's first image model, with character reference support
minimax / music-1.5
Music-1.5: Full-length songs (up to 4 mins) with natural vocals & rich instrumentation
minimax / video-01-director
Generate videos with specific camera movements
minimax / music-01
Quickly generate up to 1 minute of music with lyrics and vocals in the style of a reference track
minimax / video-01
Generate 6s videos with prompts or images. (Also known as Hailuo). Use a subject reference to make a video with a character and the S2V-01 model.
minimax / video-01-live
An image-to-video (I2V) model specifically trained for Live2D and general animation use cases
minimax / speech-02-turbo
Text-to-Audio (T2A) that offers voice synthesis, emotional expression, and multilingual capabilities. Designed for real-time applications with low latency
minimax / speech-02-hd
Text-to-Audio (T2A) that offers voice synthesis, emotional expression, and multilingual capabilities. Optimized for high-fidelity applications like voiceovers and audiobooks.
minimax / voice-cloning
Clone voices to use with Minimax's speech-02-hd and speech-02-turbo