You're looking at a specific version of this model. Jump to the model overview.
echo-the-coder /kokoro-82m-zh:e043d5a6
Input schema
The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.
| Field | Type | Default value | Description |
|---|---|---|---|
| text |
string
|
Text to synthesize. Supports Chinese/English mixed content.
|
|
| zh_voice |
string
|
zf_001
|
Voice for Chinese segments. Recommended prefix: zf_* or zm_*.
|
| en_voice |
string
|
af_maple
|
Voice for English segments. Recommended prefix: af_* or bf_*.
|
| speed |
number
|
1
Min: 0.5 Max: 1.5 |
Speech speed.
|
| segment_pause_ms |
integer
|
120
Max: 1000 |
Silence duration between Chinese/English segments (milliseconds).
|
Output schema
The shape of the response you’ll get when you run this model with an API.
Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}