You're looking at a specific version of this model. Jump to the model overview.

minimax /speech-02-turbo:43b17801

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
text
string
Text to convert to speech. Every character is 1 token. Maximum 5000 characters. Use <#x#> between words to control pause duration (0.01-99.99s).
voice_id
string
Wise_Woman
Desired voice ID. Use a voice ID you have trained (https://replicate.com/minimax/voice-cloning), or one of the following system voice IDs: Wise_Woman, Friendly_Person, Inspirational_girl, Deep_Voice_Man, Calm_Woman, Casual_Guy, Lively_Girl, Patient_Man, Young_Knight, Determined_Man, Lovely_Girl, Decent_Boy, Imposing_Manner, Elegant_Man, Abbess, Sweet_Girl_2, Exuberant_Girl
speed
number
1

Min: 0.5

Max: 2

Speech speed
volume
number
1

Max: 10

Speech volume
pitch
integer
0

Min: -12

Max: 12

Speech pitch
emotion
None
auto
Speech emotion
english_normalization
boolean
False
Enable English text normalization for better number reading (slightly increases latency)
sample_rate
None
32000
Sample rate for the generated speech
bitrate
None
128000
Bitrate for the generated speech
channel
None
mono
Number of audio channels
language_boost
None
None
Enhance recognition of specific languages and dialects

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}