You're looking at a specific version of this model. Jump to the model overview.
tmappdev /cosy_voice_cloner:5c6a1398
Input schema
The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.
Field | Type | Default value | Description |
---|---|---|---|
reference_audio |
string
|
Path to reference audio (3-10s)
|
|
text |
string
|
Text to synthesize
|
|
language |
None
|
English
|
Language mode
|
split_method |
None
|
By Sentences (4 each)
|
Text splitting method
|
speed |
number
|
1
|
Speech speed (1.0 is normal speed)
|
top_k |
integer
|
20
|
Top-K sampling
|
top_p |
number
|
0.6
|
Top-P sampling
|
temperature |
number
|
0.6
|
Sampling temperature
|
Output schema
The shape of the response you’ll get when you run this model with an API.
Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}