cjwbw / melotts

High-quality multilingual text-to-speech library

  • Public
  • 334 runs
  • GitHub
  • License

Input

Output

Run time and cost

This model runs on Nvidia T4 GPU hardware. Predictions typically complete within 3 minutes. The predict time for this model varies significantly based on the inputs.

Readme

MeloTTS

MeloTTS is a high-quality multi-lingual text-to-speech library by MyShell.ai

License

This library is under MIT License, which means it is free for both commercial and non-commercial use.

Acknowledgements

This implementation is based on TTS, VITS, VITS2 and Bert-VITS2. We appreciate their awesome work.