Collections

Generate speech

Generate natural-sounding speech from text with these powerful models. Clone your own voice or pick from a variety of languages and speaking styles.

Recommended models

zsxkib / dia

Dia 1.6B by Nari Labs, Generates realistic dialogue audio from text, including non-verbal cues and voice cloning

Updated 2 months, 1 week ago

8.7K runs

minimax / voice-cloning

Clone voices to use with Minimax's speech-02-hd and speech-02-turbo

Updated 4 months, 2 weeks ago

14K runs

lucataco / csm-1b

CSM (Conversational Speech Model) is a speech generation model from Sesame that generates RVQ audio codes from text and audio inputs

Updated 6 months ago

911 runs

lucataco / orpheus-3b-0.1-ft

Orpheus 3B - high quality, emotive Text to Speech

Updated 6 months ago

28.2K runs

cjwbw / voicecraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Updated 6 months, 1 week ago

10.6K runs

fermatresearch / spanish-f5-tts

A F5-TTS fine-tuned for Spanish

Updated 10 months, 2 weeks ago

803 runs

x-lance / f5-tts

F5-TTS, the new state-of-the-art in open source voice cloning

Updated 11 months, 1 week ago

29.7K runs

platform-kit / mars5-tts

A novel speech model for insane prosody.

Updated 1 year, 3 months ago

507 runs

chenxwh / openvoice

Updated to OpenVoice v2: Versatile Instant Voice Cloning

Updated 1 year, 4 months ago

72.1K runs

cjwbw / parler-tts

lightweight text-to-speech (TTS) model, trained on 10.5K hours of audio data

Updated 1 year, 5 months ago

2.6K runs

adirik / styletts2

Generates speech from text

Updated 1 year, 7 months ago

131.7K runs

lucataco / pheme

Pheme generates a variety of conversational voices in 16 kHz for phone-call applications

Updated 1 year, 8 months ago

552 runs

lucataco / xtts-v2

Coqui XTTS-v2: Multilingual Text To Speech Voice Cloning

Updated 1 year, 9 months ago

4.3M runs

zsxkib / realistic-voice-cloning

Create song covers with any RVC v2 trained AI voice from audio files.

Updated 1 year, 10 months ago

1M runs

cjwbw / seamless_​communication

SeamlessM4T—Massively Multilingual & Multimodal Machine Translation

Updated 2 years ago

87.1K runs

awerks / neon-tts

NeonAI Coqui AI TTS Plugin.

Updated 2 years, 1 month ago

153.7K runs

suno-ai / bark

🔊 Text-Prompted Generative Audio Model

Updated 2 years, 4 months ago

301.6K runs

afiaka87 / tortoise-tts

Generate speech from text, clone voices from mp3 files. From James Betker AKA "neonbjb".

Updated 3 years, 1 month ago

172.3K runs