Collections

Generate speech

Generate natural-sounding speech from text with these powerful models. Clone your own voice or pick from a variety of languages and speaking styles.

Recommended models

zsxkib / dia

Dia 1.6B by Nari Labs, Generates realistic dialogue audio from text, including non-verbal cues and voice cloning

Updated 1 month, 2 weeks ago

8K runs

minimax / voice-cloning

Clone voices to use with Minimax's speech-02-hd and speech-02-turbo

Updated 3 months, 4 weeks ago

11.8K runs

lucataco / csm-1b

CSM (Conversational Speech Model) is a speech generation model from Sesame that generates RVQ audio codes from text and audio inputs

Updated 5 months, 2 weeks ago

891 runs

lucataco / orpheus-3b-0.1-ft

Orpheus 3B - high quality, emotive Text to Speech

Updated 5 months, 2 weeks ago

22.8K runs

cjwbw / voicecraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Updated 5 months, 2 weeks ago

10.5K runs

fermatresearch / spanish-f5-tts

A F5-TTS fine-tuned for Spanish

Updated 9 months, 3 weeks ago

741 runs

x-lance / f5-tts

F5-TTS, the new state-of-the-art in open source voice cloning

Updated 10 months, 3 weeks ago

28.3K runs

platform-kit / mars5-tts

A novel speech model for insane prosody.

Updated 1 year, 2 months ago

500 runs

chenxwh / openvoice

Updated to OpenVoice v2: Versatile Instant Voice Cloning

Updated 1 year, 3 months ago

70.1K runs

cjwbw / parler-tts

lightweight text-to-speech (TTS) model, trained on 10.5K hours of audio data

Updated 1 year, 4 months ago

2.6K runs

camenduru / metavoice

MetaVoice-1B: 1.2B parameter base model trained on 100K hours of speech

Updated 1 year, 6 months ago

12.5K runs

adirik / styletts2

Generates speech from text

Updated 1 year, 7 months ago

131.6K runs

lucataco / pheme

Pheme generates a variety of conversational voices in 16 kHz for phone-call applications

Updated 1 year, 7 months ago

546 runs

zsxkib / realistic-voice-cloning

Create song covers with any RVC v2 trained AI voice from audio files.

Updated 1 year, 9 months ago

997.1K runs

cjwbw / seamless_​communication

SeamlessM4T—Massively Multilingual & Multimodal Machine Translation

Updated 1 year, 11 months ago

86.2K runs

awerks / neon-tts

NeonAI Coqui AI TTS Plugin.

Updated 2 years ago

148.8K runs

suno-ai / bark

🔊 Text-Prompted Generative Audio Model

Updated 2 years, 4 months ago

301.2K runs

afiaka87 / tortoise-tts

Generate speech from text, clone voices from mp3 files. From James Betker AKA "neonbjb".

Updated 3 years, 1 month ago

172.2K runs