You're looking at a specific version of this model. Jump to the model overview.

echo-the-coder /kokoro-82m-zh:e043d5a6

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
text
string
Text to synthesize. Supports Chinese/English mixed content.
zh_voice
string
zf_001
Voice for Chinese segments. Recommended prefix: zf_* or zm_*.
en_voice
string
af_maple
Voice for English segments. Recommended prefix: af_* or bf_*.
speed
number
1

Min: 0.5

Max: 1.5

Speech speed.
segment_pause_ms
integer
120

Max: 1000

Silence duration between Chinese/English segments (milliseconds).

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}