You're looking at a specific version of this model. Jump to the model overview.

ahm3texe /test999:6e1bd175

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
input_audio
string
Upload your audio file here.
rvc_model
string (enum)
Obama

Options:

Trump, Sandy, Rogan, Obama, CUSTOM

RVC model for a specific voice. If using a custom model, this should match the name of the downloaded model. If a 'custom_rvc_model_download_url' is provided, this will be automatically set to the name of the downloaded model.
custom_rvc_model_download_url
string
URL to download a custom RVC model. If provided, the model will be downloaded (if it doesn't already exist) and used for prediction, regardless of the 'rvc_model' value.
pitch_change
number
0
Adjust pitch of AI vocals in semitones. Use positive values to increase pitch, negative to decrease.
index_rate
number
0.5

Max: 1

Control how much of the AI's accent to leave in the vocals.
filter_radius
integer
3

Max: 7

If >=3: apply median filtering to the harvested pitch results.
rms_mix_rate
number
0.25

Max: 1

Control how much to use the original vocal's loudness (0) or a fixed loudness (1).
f0_method
string (enum)
rmvpe

Options:

rmvpe, mangio-crepe

Pitch detection algorithm. 'rmvpe' for clarity in vocals, 'mangio-crepe' for smoother vocals.
crepe_hop_length
integer
128
When `f0_method` is set to `mangio-crepe`, this controls how often it checks for pitch changes in milliseconds.
protect
number
0.33

Max: 0.5

Control how much of the original vocals' breath and voiceless consonants to leave in the AI vocals. Set 0.5 to disable.
output_format
string (enum)
mp3

Options:

mp3, wav

wav for best quality and large file size, mp3 for decent quality and small file size.

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}