minimax | Replicate

minimax / music-cover

Reimagine any song in a different style — change voice, instruments, genre, and arrangement while keeping the original melody

8.1K runs

Public

minimax / music-2.6

Generate full-length songs or instrumentals from a text prompt, with optional auto-generated lyrics

22.3K runs

Public

minimax / music-2.5

Generate full-length songs with vocals, lyrics, and rich instrumentation from a text prompt

11.4K runs

Public

minimax / speech-2.8-turbo

Minimax Speech 2.8 Turbo: Turn text into natural, expressive speech with voice cloning, emotion control, and support for 40+ languages

549.3K runs

Public

minimax / speech-2.8-hd

Minimax Speech 2.8 HD focuses on high-fidelity audio generation with features like studio-grade quality, flexible emotion control, multilingual support, and voice cloning capabilities

171K runs

Public

minimax / speech-2.6-hd

MiniMax Speech 2.6 HD delivers studio-quality multilingual text-to-audio on Replicate with nuanced prosody, subtitle export, and premium voices

194.7K runs

Public

minimax / speech-2.6-turbo

Low‑latency MiniMax Speech 2.6 Turbo brings multilingual, emotional text-to-speech to Replicate with 300+ voices and real-time friendly pricing

1.2M runs

Public

minimax / hailuo-2.3-fast

A lower-latency image-to-video version of Hailuo 2.3 that preserves core motion quality, visual consistency, and stylization performance while enabling faster iteration cycles.

230K runs

Public

minimax / hailuo-2.3

A high-fidelity video generation model optimized for realistic human motion, cinematic VFX, expressive characters, and strong prompt and style adherence across both text-to-video and image-to-video workflows

110.4K runs

Public

minimax / music-1.5

Music-1.5: Full-length songs (up to 4 mins) with natural vocals & rich instrumentation

99.2K runs

Public

minimax / hailuo-02-fast

A low cost and fast version of Hailuo 02. Generate 6s and 10s videos in 512p

56.9K runs

Public

minimax / hailuo-02

Hailuo 2 is a text-to-video and image-to-video model that can make 6s or 10s videos at 768p (standard) or 1080p (pro). It excels at real world physics.

434K runs

Public

minimax / voice-cloning

Clone voices to use with Minimax's speech-02-hd and speech-02-turbo

73.5K runs

Public

minimax / speech-02-turbo

Text-to-Audio (T2A) that offers voice synthesis, emotional expression, and multilingual capabilities. Designed for real-time applications with low latency

12.7M runs

Public

minimax / speech-02-hd

Text-to-Audio (T2A) that offers voice synthesis, emotional expression, and multilingual capabilities. Optimized for high-fidelity applications like voiceovers and audiobooks.

2.6M runs

Public