license: apache-2.0
language:
- en
base_model:
- yl4579/StyleTTS2-LJSpeech
pipeline_tag: text-to-speech
Disclaimer
This is a fork of the original Kokoro repo, in order to provide easy inference on Replicate. I am not affiliated with the original Kokoro authors, and this is not an official release of the Kokoro model. Similar to the Huggingface Space, this implementation provides automatic text splitting to support long form text inputs. See the original README below for more details.
Voices
American English
lang_code='a' in misaki[en]
espeak-ng en-us fallback
Name
Traits
Target Quality
Training Duration
Overall Grade
SHA256
af_heart
๐บโค๏ธ
A
0ab5709b
af_alloy
๐บ
B
MM minutes
C
6d877149
af_aoede
๐บ
B
H hours
C+
c03bd1a4
af_bella
๐บ๐ฅ
A
HH hours
A-
8cb64e02
af_jessica
๐บ
C
MM minutes
D
cdfdccb8
af_kore
๐บ
B
H hours
C+
8bfbc512
af_nicole
๐บ๐ง
B
HH hours
B-
c5561808
af_nova
๐บ
B
MM minutes
C
e0233676
af_river
๐บ
C
MM minutes
D
e149459b
af_sarah
๐บ
B
H hours
C+
49bd364e
af_sky
๐บ
B
M minutes ๐ค
C-
c799548a
am_adam
๐น
D
H hours
F+
ced7e284
am_echo
๐น
C
MM minutes
D
8bcfdc85
am_eric
๐น
C
MM minutes
D
ada66f0e
am_fenrir
๐น
B
H hours
C+
98e507ec
am_liam
๐น
C
MM minutes
D
c8255075
am_michael
๐น
B
H hours
C+
9a443b79
am_onyx
๐น
C
MM minutes
D
e8452be1
am_puck
๐น
B
H hours
C+
dd1d8973
am_santa
๐น
C
M minutes ๐ค
D-
7f2f7582
British English
lang_code='b' in misaki[en]
espeak-ng en-gb fallback
Name
Traits
Target Quality
Training Duration
Overall Grade
SHA256
bf_alice
๐บ
C
MM minutes
D
d292651b
bf_emma
๐บ
B
HH hours
B-
d0a423de
bf_isabella
๐บ
B
MM minutes
C
cdd4c370
bf_lily
๐บ
C
MM minutes
D
6e09c2e4
bm_daniel
๐น
C
MM minutes
D
fc3fce4e
bm_fable
๐น
B
MM minutes
C
d44935f3
bm_george
๐น
B
MM minutes
C
f1bc8122
bm_lewis
๐น
C
H hours
D+
b5204750
Japanese
lang_code='j' in misaki[ja]
Total Japanese training data: H hours
Name
Traits
Target Quality
Training Duration
Overall Grade
SHA256
CC BY
jf_alpha
๐บ
B
H hours
C+
1bf4c9dc
jf_gongitsune
๐บ
B
MM minutes
C
1b171917
gongitsune
jf_nezumi
๐บ
B
M minutes ๐ค
C-
d83f007a
nezuminoyomeiri
jf_tebukuro
๐บ
B
MM minutes
C
0d691790
tebukurowokaini
jm_kumo
๐น
B
M minutes ๐ค
C-
98340afd
kumonoito
Mandarin Chinese
lang_code='z' in misaki[zh]
Total Mandarin Chinese training data: H hours
Name
Traits
Target Quality
Training Duration
Overall Grade
SHA256
zf_xiaobei
๐บ
C
MM minutes
D
9b76be63
zf_xiaoni
๐บ
C
MM minutes
D
95b49f16
zf_xiaoxiao
๐บ
C
MM minutes
D
cfaf6f2d
zf_xiaoyi
๐บ
C
MM minutes
D
b5235dba
zm_yunjian
๐น
C
MM minutes
D
76cbf8ba
zm_yunxi
๐น
C
MM minutes
D
dbe6e1ce
zm_yunxia
๐น
C
MM minutes
D
bb2b03b0
zm_yunyang
๐น
C
MM minutes
D
5238ac22
Spanish
Name
Traits
SHA256
ef_dora
๐บ
d9d69b0f
em_alex
๐น
5eac53f7
em_santa
๐น
aa8620cb
French
lang_code='f' in misaki[en]
espeak-ng fr-fr
Total French training data: <11 hours
Name
Traits
Target Quality
Training Duration
Overall Grade
SHA256
CC BY
ff_siwis
๐บ
B
<11 hours
B-
8073bf2d
SIWIS
Hindi
lang_code='h' in misaki[en]
espeak-ng hi
Total Hindi training data: H hours
Name
Traits
Target Quality
Training Duration
Overall Grade
SHA256
hf_alpha
๐บ
B
MM minutes
C
06906fe0
hf_beta
๐บ
B
MM minutes
C
63c0a1a6
hm_omega
๐น
B
MM minutes
C
b55f02a8
hm_psi
๐น
B
MM minutes
C
2f0f055c
Italian
lang_code='i' in misaki[en]
espeak-ng it
Total Italian training data: H hours
Name
Traits
Target Quality
Training Duration
Overall Grade
SHA256
if_sara
๐บ
B
MM minutes
C
6c0b253b
im_nicola
๐น
B
MM minutes
C
234ed066
Brazilian Portuguese
Name
Traits
SHA256
pf_dora
๐บ
07e4ff98
pm_alex
๐น
cf0ba8c5
pm_santa
๐น
d4210316
โจ You can now pip install kokoro! See Usage .
Kokoro is an open-weight TTS model with 82 million parameters. Despite its lightweight architecture, it delivers comparable quality to larger models while being significantly faster and more cost-efficient. With Apache-licensed weights, Kokoro can be deployed anywhere from production environments to personal projects.
Creative Commons Attribution
The following CC BY audio was part of the dataset used to train Kokoro v1.0.
Audio Data
Duration Used
License
Added to Training Set After
Koniwa tnc
<1h
CC BY 3.0
v0.19 / 22 Nov 2024
SIWIS
<11h
CC BY 4.0
v0.19 / 22 Nov 2024
Acknowledgements
๐ ๏ธ @yl4579 for architecting StyleTTS 2.
๐ @Pendrokar for adding Kokoro as a contender in the TTS Spaces Arena.
๐ Thank you to everyone who contributed synthetic training data.
โค๏ธ Special thanks to all compute sponsors.
๐พ Discord server: https://discord.gg/QuGxSWBfQy
๐ชฝ Kokoro is a Japanese word that translates to โheartโ or โspiritโ. Kokoro is also the name of an AI in the Terminator franchise .
Model created
10ย months ago