You're looking at a specific version of this model. Jump to the model overview.

zsxkib /realistic-voice-cloning:a0076ea1

Input

file

Upload your audio file here.

string
Shift + Return to add a new line

RVC model for a specific voice. If using a custom model, this should match the 'custom_rvc_model_download_name'.

Default: "Squidward"

string
Shift + Return to add a new line

URL to download a custom RVC model. To use the downloaded model, 'rvc_model' should be set to the same value as 'custom_rvc_model_download_name'.

string
Shift + Return to add a new line

The name of the custom RVC model. This should match the 'rvc_model' if you want to use the downloaded model.

string

Adjust pitch of AI vocals. Options: `no-change`, `male-to-female`, `female-to-male`.

Default: "no-change"

number
(minimum: 0, maximum: 1)

Control how much of the AI's accent to leave in the vocals.

Default: 0.5

integer
(minimum: 0, maximum: 7)

If >=3: apply median filtering median filtering to the harvested pitch results.

Default: 3

number
(minimum: 0, maximum: 1)

Control how much to use the original vocal's loudness (0) or a fixed loudness (1).

Default: 0.25

string

Best option is rmvpe (clarity in vocals), then mangio-crepe (smoother vocals).

Default: "rmvpe"

integer

When `pitch_detection_algo` is set to `mangio-crepe`, this controls how often it checks for pitch changes in milliseconds. Lower values lead to longer conversions and higher risk of voice cracks, but better pitch accuracy.

Default: 128

number
(minimum: 0, maximum: 0.5)

Control how much of the original vocals' breath and voiceless consonants to leave in the AI vocals. Set 0.5 to disable.

Default: 0.33

number

Control volume of main AI vocals. Use -3 to decrease the volume by 3 decibels, or 3 to increase the volume by 3 decibels.

Default: 0

number

Control volume of backup AI vocals.

Default: 0

number

Control volume of the background music/instrumentals.

Default: 0

number

Change pitch/key of background music, backup vocals and AI vocals in semitones. Reduces sound quality slightly.

Default: 0

number
(minimum: 0, maximum: 1)

The larger the room, the longer the reverb time.

Default: 0.15

number
(minimum: 0, maximum: 1)

Level of AI vocals with reverb.

Default: 0.2

number
(minimum: 0, maximum: 1)

Level of AI vocals without reverb.

Default: 0.8

number
(minimum: 0, maximum: 1)

Absorption of high frequencies in the reverb.

Default: 0.7

string

wav for best quality and large file size, mp3 for decent quality and small file size.

Default: "mp3"

Output

No output yet! Press "Submit" to start a prediction.