You're looking at a specific version of this model. Jump to the model overview.

minimax /voice-cloning:aa25ee12

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
voice_file
string
Voice file to clone. Must be MP3, M4A, or WAV format, 10s to 5min duration, and less than 20MB.
need_noise_reduction
boolean
False
Enable noise reduction. Use this if the voice file has background noise.
model
None
speech-02-turbo
The text-to-speech model to train
accuracy
number
0.7

Max: 1

Text validation accuracy threshold (0-1)
need_volume_normalization
boolean
False
Enable volume normalization

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'properties': {'model': {'title': 'Model', 'type': 'string'},
                'preview': {'format': 'uri',
                            'title': 'Preview',
                            'type': 'string'},
                'voice_id': {'title': 'Voice Id', 'type': 'string'}},
 'required': ['voice_id', 'preview', 'model'],
 'title': 'VoiceCloningOutputs',
 'type': 'object'}