You're looking at a specific version of this model. Jump to the model overview.

adirik /hierspeechpp:f49be509

Input

string
Shift + Return to add a new line

Text input to the model. If provided, it will be used for the speech content of the output.

file

Sound input to the model, wav or mp3 file. If provided, it will be used for the speech content of the output.

*file

A voice clip containing the speaker to synthesize, , wav or mp3 file.

number
(minimum: 0, maximum: 1)

Noise control. 0 means no noise reduction, 1 means maximum noise reduction. If noise reduction is desired, it is recommended to set this value to 0.6~0.8

Default: 0

number
(minimum: 0, maximum: 1)

Temperature for text-to-vector model. Larger value corresponds to slightly more random output.

Default: 0.33

number
(minimum: 0, maximum: 1)

Temperature for the voice conversion model. Larger value corresponds to slightly more random output.

Default: 0.33

integer

Sample rate of the output audio file.

Default: 16000

boolean

Scale normalization. If set to true, the output audio will be scaled according to the input sound if provided.

Default: false

integer

Random seed to use for reproducibility.

Output

No output yet! Press "Submit" to start a prediction.