You're looking at a specific version of this model. Jump to the model overview.

suminhthanh /vixtts:29b957e2

Input

string
Shift + Return to add a new line

Text to synthesize

Default: "Xin chào các bạn"

*file

Original speaker audio (wav, mp3, m4a, ogg, or flv). Duration should be at least 6 seconds.

string

Output language for the synthesised speech

Default: "vi"

boolean

Whether to apply denoising to the speaker audio (microphone recordings)

Default: false

boolean

Whether to use deepfilter

Default: false

boolean

Whether to normalize the text

Default: false

Output

No output yet! Press "Submit" to start a prediction.