• Public
  • 22 runs
Iterate in playground

Input

*file

the audio to be copied

string
Shift + Return to add a new line

the word to word transcription of the reference audio

Default: ""

*string
Shift + Return to add a new line

the text for which audio need to be generated

boolean

remove silences from the generated audio

Default: false

number
(minimum: 0.3, maximum: 2)

Default: 1

integer
(minimum: 4, maximum: 64)

Number of denoising steps

Default: 32

number
(minimum: 0.3, maximum: 2)

The speed up factor of the generated audio

Default: 1

string

An enumeration.

Default: "E2-TTS"

Output

No output yet! Press "Submit" to start a prediction.

Run time and cost

This model runs on Nvidia L40S GPU hardware. We don't yet have enough runs of this model to provide performance information.

Readme

This model doesn't have a readme.