minimax/speech-2.6-hd:521e73cd | Run with an API on Replicate

You're looking at a specific version of this model. Jump to the model overview.

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field	Type	Default value	Description
emotion	None	auto	Speech emotion
sample_rate	None	32000	Sample rate for the generated speech
bitrate	None	128000	Bitrate for the generated speech
audio_format	None	mp3	Audio format for the generated speech
channel	None	mono	Number of audio channels
output_format	None	hex	Transport format for the generated audio payload
language_boost	None	None	Enhance recognition of specific languages and dialects
text	string		Text to convert to speech. Every character is 1 token. Maximum 10000 characters. Use <#x#> between words to control pause duration (0.01-99.99s).
pitch	integer	0 Min: -12 Max: 12	Speech pitch
speed	number	1 Min: 0.5 Max: 2	Speech speed
volume	number	1 Max: 10	Speech volume
voice_id	string	Wise_Woman	Desired voice ID. Use a voice ID you have trained (https://replicate.com/minimax/voice-cloning), or one of the following system voice IDs: Wise_Woman, Friendly_Person, Inspirational_girl, Deep_Voice_Man, Calm_Woman, Casual_Guy, Lively_Girl, Patient_Man, Young_Knight, Determined_Man, Lovely_Girl, Decent_Boy, Imposing_Manner, Elegant_Man, Abbess, Sweet_Girl_2, Exuberant_Girl
subtitle_enable	boolean	False	Return subtitle metadata alongside audio (non-streaming only)
english_normalization	boolean	False	Enable English text normalization for better number reading (slightly increases latency)

The shape of the response you’ll get when you run this model with an API.

Schema

{'format': 'uri', 'title': 'Output', 'type': 'string'}