Generate natural-sounding speech from text with these powerful models. Clone your own voice or pick from a variety of languages and speaking styles.
Featured models
resemble-ai/chatterbox-multilingual
Generate expressive, natural speech in 23 languages. Features instant voice cloning from short audio, emotion control, and seamless cross-language voice transfer.
Updated 1 month, 1 week ago
2.2K runs
resemble-ai/chatterbox
Generate expressive, natural speech. Features unique emotion control, instant voice cloning from short audio, and built-in watermarking.
Updated 3 months, 3 weeks ago
119.4K runs
resemble-ai/chatterbox-pro
Generate expressive, natural speech with Resemble AI's Chatterbox.
Updated 3 months, 4 weeks ago
13K runs
minimax/speech-02-turbo
Text-to-Audio (T2A) that offers voice synthesis, emotional expression, and multilingual capabilities. Designed for real-time applications with low latency
Updated 5 months, 1 week ago
4.2M runs
minimax/speech-02-hd
Text-to-Audio (T2A) that offers voice synthesis, emotional expression, and multilingual capabilities. Optimized for high-fidelity applications like voiceovers and audiobooks.
Updated 5 months, 1 week ago
837.4K runs
jaaari/kokoro-82m
Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)
Updated 8 months, 2 weeks ago
50.6M runs
Recommended Models
Recommended Models
zsxkib/dia
Dia 1.6B by Nari Labs, Generates realistic dialogue audio from text, including non-verbal cues and voice cloning
Updated 3 months ago
9.2K runs
minimax/voice-cloning
Clone voices to use with Minimax's speech-02-hd and speech-02-turbo
Updated 5 months, 1 week ago
16.4K runs
lucataco/csm-1b
CSM (Conversational Speech Model) is a speech generation model from Sesame that generates RVQ audio codes from text and audio inputs
Updated 6 months, 3 weeks ago
954 runs
lucataco/orpheus-3b-0.1-ft
Orpheus 3B - high quality, emotive Text to Speech
Updated 6 months, 3 weeks ago
29.1K runs
cjwbw/voicecraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Updated 7 months ago
10.6K runs
fermatresearch/spanish-f5-tts
A F5-TTS fine-tuned for Spanish
Updated 11 months ago
889 runs
x-lance/f5-tts
F5-TTS, the new state-of-the-art in open source voice cloning
Updated 1 year ago
31.2K runs
platform-kit/mars5-tts
A novel speech model for insane prosody.
Updated 1 year, 3 months ago
515 runs
chenxwh/openvoice
Updated to OpenVoice v2: Versatile Instant Voice Cloning
Updated 1 year, 4 months ago
75.9K runs
cjwbw/parler-tts
lightweight text-to-speech (TTS) model, trained on 10.5K hours of audio data
Updated 1 year, 6 months ago
2.6K runs
adirik/styletts2
Generates speech from text
Updated 1 year, 8 months ago
131.8K runs
lucataco/pheme
Pheme generates a variety of conversational voices in 16 kHz for phone-call applications
Updated 1 year, 9 months ago
556 runs
lucataco/xtts-v2
Coqui XTTS-v2: Multilingual Text To Speech Voice Cloning
Updated 1 year, 10 months ago
4.4M runs
zsxkib/realistic-voice-cloning
Create song covers with any RVC v2 trained AI voice from audio files.
Updated 1 year, 11 months ago
1.1M runs
cjwbw/seamless_communication
SeamlessM4T—Massively Multilingual & Multimodal Machine Translation
Updated 2 years, 1 month ago
88.2K runs
awerks/neon-tts
NeonAI Coqui AI TTS Plugin.
Updated 2 years, 2 months ago
158.3K runs
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
Updated 2 years, 5 months ago
301.9K runs
afiaka87/tortoise-tts
Generate speech from text, clone voices from mp3 files. From James Betker AKA "neonbjb".
Updated 3 years, 2 months ago
172.5K runs