You're looking at a specific version of this model. Jump to the model overview.
afiaka87 /tortoise-tts:e9658de4
Input schema
The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.
Field | Type | Default value | Description |
---|---|---|---|
text |
string
|
The expressiveness of autoregressive transformers is literally nuts! I absolutely adore them.
|
Text to speak.
|
voice_a |
string
(enum)
|
random
Options: angie, cond_latent_example, deniro, freeman, halle, lj, myself, pat2, snakes, tom, train_daws, train_dreams, train_grace, train_lescault, weaver, applejack, daniel, emma, geralt, jlaw, mol, pat, rainbow, tim_reynolds, train_atkins, train_dotrice, train_empire, train_kennard, train_mouse, william, random, custom_voice, disabled |
Selects the voice to use for generation. Use `random` to select a random voice. Use `custom_voice` to use a custom voice.
|
custom_voice |
string
|
(Optional) Create a custom voice based on an mp3 file of a speaker. Audio should be at least 15 seconds, only contain one speaker, and be in mp3 format. Overrides the `voice_a` input.
|
|
voice_b |
string
(enum)
|
disabled
Options: angie, cond_latent_example, deniro, freeman, halle, lj, myself, pat2, snakes, tom, train_daws, train_dreams, train_grace, train_lescault, weaver, applejack, daniel, emma, geralt, jlaw, mol, pat, rainbow, tim_reynolds, train_atkins, train_dotrice, train_empire, train_kennard, train_mouse, william, random, custom_voice, disabled |
(Optional) Create new voice from averaging the latents for `voice_a`, `voice_b` and `voice_c`. Use `disabled` to disable voice mixing.
|
voice_c |
string
(enum)
|
disabled
Options: angie, cond_latent_example, deniro, freeman, halle, lj, myself, pat2, snakes, tom, train_daws, train_dreams, train_grace, train_lescault, weaver, applejack, daniel, emma, geralt, jlaw, mol, pat, rainbow, tim_reynolds, train_atkins, train_dotrice, train_empire, train_kennard, train_mouse, william, random, custom_voice, disabled |
(Optional) Create new voice from averaging the latents for `voice_a`, `voice_b` and `voice_c`. Use `disabled` to disable voice mixing.
|
preset |
string
(enum)
|
fast
Options: ultra_fast, fast, standard, high_quality |
Which voice preset to use. See the documentation for more information.
|
seed |
integer
|
0
|
Random seed which can be used to reproduce results.
|
cvvp_amount |
number
|
0
Max: 1 |
How much the CVVP model should influence the output. Increasing this can in some cases reduce the likelyhood of multiple speakers. Defaults to 0 (disabled)
|
Output schema
The shape of the response you’ll get when you run this model with an API.
{'format': 'uri', 'title': 'Output', 'type': 'string'}