You're looking at a specific version of this model. Jump to the model overview.

afiaka87 /tortoise-tts:e9658de4

Input

string
Shift + Return to add a new line

Text to speak.

Default: "The expressiveness of autoregressive transformers is literally nuts! I absolutely adore them."

string

Selects the voice to use for generation. Use `random` to select a random voice. Use `custom_voice` to use a custom voice.

Default: "random"

file

(Optional) Create a custom voice based on an mp3 file of a speaker. Audio should be at least 15 seconds, only contain one speaker, and be in mp3 format. Overrides the `voice_a` input.

string

(Optional) Create new voice from averaging the latents for `voice_a`, `voice_b` and `voice_c`. Use `disabled` to disable voice mixing.

Default: "disabled"

string

(Optional) Create new voice from averaging the latents for `voice_a`, `voice_b` and `voice_c`. Use `disabled` to disable voice mixing.

Default: "disabled"

string

Which voice preset to use. See the documentation for more information.

Default: "fast"

integer

Random seed which can be used to reproduce results.

Default: 0

number
(minimum: 0, maximum: 1)

How much the CVVP model should influence the output. Increasing this can in some cases reduce the likelyhood of multiple speakers. Defaults to 0 (disabled)

Default: 0

Output

Video Player is loading.
Current Time 00:00:000
Duration 00:00:000
Loaded: 0%
Stream Type LIVE
Remaining Time 00:00:000
 
1x
Generated in