camenduru / metavoice

MetaVoice-1B: 1.2B parameter base model trained on 100K hours of speech

  • Public
  • 11.8K runs
  • L40S
  • GitHub
  • License

Input

Video Player is loading.
Current Time 00:00:000
Duration 00:00:000
Loaded: 0%
Stream Type LIVE
Remaining Time 00:00:000
 
1x
*file

Input Image

string
Shift + Return to add a new line

Default: "This is a demo of text to speech by MetaVoice-1B, an open-source foundational audio model by MetaVoice."

Output

Video Player is loading.
Current Time 00:00:000
Duration 00:00:000
Loaded: 0%
Stream Type LIVE
Remaining Time 00:00:000
 
1x
Generated in

Run time and cost

This model costs approximately $0.091 to run on Replicate, or 10 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia L40S GPU hardware. Predictions typically complete within 94 seconds. The predict time for this model varies significantly based on the inputs.