
minimax / speech-02-turbo
Text-to-Audio (T2A) that offers voice synthesis, emotional expression, and multilingual capabilities. Designed for real-time applications with low latency

minimax / speech-02-hd
Text-to-Audio (T2A) that offers voice synthesis, emotional expression, and multilingual capabilities. Optimized for high-fidelity applications like voiceovers and audiobooks.

minimax / voice-cloning
Clone voices to use with Minimax's speech-02-hd and speech-02-turbo

minimax / image-01
Minimax's first image model, with character reference support

minimax / video-01-director
Generate videos with specific camera movements

minimax / video-01
Generate 6s videos with prompts or images. (Also known as Hailuo). Use a subject reference to make a video with a character and the S2V-01 model.

minimax / music-01
Quickly generate up to 1 minute of music with lyrics and vocals in the style of a reference track

minimax / video-01-live
An image-to-video (I2V) model specifically trained for Live2D and general animation use cases