You're looking at a specific version of this model. Jump to the model overview.
resemble-ai /chatterbox-multilingual:9cfba4c2
Input schema
The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.
Field | Type | Default value | Description |
---|---|---|---|
text |
string
|
Text to synthesize into speech (maximum 300 characters)
|
|
language |
None
|
en
|
Language for synthesis
|
reference_audio |
string
|
Reference audio file for voice cloning (optional). If not provided, uses default voice for the selected language.
|
|
exaggeration |
number
|
0.5
Min: 0.25 Max: 2 |
Controls speech expressiveness (0.25-2.0, neutral=0.5, extreme values may be unstable)
|
temperature |
number
|
0.8
Min: 0.05 Max: 5 |
Controls randomness in generation (0.05-5.0, higher=more varied)
|
cfg_weight |
number
|
0.5
Min: 0.2 Max: 1 |
CFG/Pace weight controlling generation guidance (0.2-1.0). Use 0.5 for balanced results, 0 for language transfer
|
seed |
integer
|
0
|
Random seed for reproducible results (0 for random generation)
|
Output schema
The shape of the response you’ll get when you run this model with an API.
Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}