ttsds / whisperspeech

  • Public
  • 1K runs
  • A100 (80GB)

Input

Video Player is loading.
Current Time 00:00:000
Duration 00:00:000
Loaded: 0%
Stream Type LIVE
Remaining Time 00:00:000
 
1x
*string
Shift + Return to add a new line

Text to synthesize

*file

Reference audio file

string

Version of the model to use

Default: "small"

string

Language of the text

Default: "en"

Output

Video Player is loading.
Current Time 00:00:000
Duration 00:00:000
Loaded: 0%
Stream Type LIVE
Remaining Time 00:00:000
 
1x
Generated in

This example was created by a different version, ttsds/whisperspeech:e7a3743e.

Run time and cost

This model costs approximately $0.020 to run on Replicate, or 50 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia A100 (80GB) GPU hardware. Predictions typically complete within 14 seconds.

Readme

This model doesn't have a readme.