You're looking at a specific version of this model. Jump to the model overview.

resemble-ai /chatterbox-multilingual:9cfba4c2

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
text
string
Text to synthesize into speech (maximum 300 characters)
language
None
en
Language for synthesis
reference_audio
string
Reference audio file for voice cloning (optional). If not provided, uses default voice for the selected language.
exaggeration
number
0.5

Min: 0.25

Max: 2

Controls speech expressiveness (0.25-2.0, neutral=0.5, extreme values may be unstable)
temperature
number
0.8

Min: 0.05

Max: 5

Controls randomness in generation (0.05-5.0, higher=more varied)
cfg_weight
number
0.5

Min: 0.2

Max: 1

CFG/Pace weight controlling generation guidance (0.2-1.0). Use 0.5 for balanced results, 0 for language transfer
seed
integer
0
Random seed for reproducible results (0 for random generation)

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}