You're looking at a specific version of this model. Jump to the model overview.

adirik /hierspeechpp:4e41ecdd

Input

string
Shift + Return to add a new line

Text input to the model. If provided, it will be used for the speech content of the output.

file

Sound input to the model. If provided, it will be used for the speech content of the output.

*file

A voice clip containing the speaker to synthesize

number
(minimum: 0, maximum: 1)

Noise control. 0 means no noise reduction, 1 means maximum noise reduction. If noise reduction is desired, it is recommended to set this value to 0.6~0.8

Default: 0

number
(minimum: 0, maximum: 1)

Temperature for text-to-vector model. Larger value corresponds to slightly more random output.

Default: 0.33

number
(minimum: 0, maximum: 1)

Temperature for the voice conversion model. Larger value corresponds to slightly more random output.

Default: 0.33

integer

Sample rate of the output audio file

Default: 16000

boolean

Scale normalization. If set to true, the output audio will be scaled according to the input sound if provided.

Default: false

integer

Random seed to use for reproducibility

Output

No output yet! Press "Submit" to start a prediction.