You're looking at a specific version of this model. Jump to the model overview.
minimax /speech-02-turbo:43b17801
Input schema
The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.
| Field | Type | Default value | Description |
|---|---|---|---|
| text |
string
|
Text to convert to speech. Every character is 1 token. Maximum 5000 characters. Use <#x#> between words to control pause duration (0.01-99.99s).
|
|
| voice_id |
string
|
Wise_Woman
|
Desired voice ID. Use a voice ID you have trained (https://replicate.com/minimax/voice-cloning), or one of the following system voice IDs: Wise_Woman, Friendly_Person, Inspirational_girl, Deep_Voice_Man, Calm_Woman, Casual_Guy, Lively_Girl, Patient_Man, Young_Knight, Determined_Man, Lovely_Girl, Decent_Boy, Imposing_Manner, Elegant_Man, Abbess, Sweet_Girl_2, Exuberant_Girl
|
| speed |
number
|
1
Min: 0.5 Max: 2 |
Speech speed
|
| volume |
number
|
1
Max: 10 |
Speech volume
|
| pitch |
integer
|
0
Min: -12 Max: 12 |
Speech pitch
|
| emotion |
None
|
auto
|
Speech emotion
|
| english_normalization |
boolean
|
False
|
Enable English text normalization for better number reading (slightly increases latency)
|
| sample_rate |
None
|
32000
|
Sample rate for the generated speech
|
| bitrate |
None
|
128000
|
Bitrate for the generated speech
|
| channel |
None
|
mono
|
Number of audio channels
|
| language_boost |
None
|
None
|
Enhance recognition of specific languages and dialects
|
Output schema
The shape of the response you’ll get when you run this model with an API.
Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}