cuuupid / zonos

Zonos-v0.1 beta, a SOTA text-to-speech Transformer model with extraordinary expressive range, built by Zyphra. (Updated 4 months, 1 week ago)

  • Public
  • 260 runs
  • GitHub
  • Weights
  • Paper
  • License

Create training

Trainings for this model run on Nvidia L40S GPU hardware, which costs $0.000975 per second. Upon creation, you will be redirected to the training detail page where you can monitor your training's progress, and eventually download the weights and run the trained model.

Note: versions of this model with fast booting use the hardware set by the base model they were trained from.

Learn more about training

If you haven't yet trained a model on Replicate, you can read one of the following guides.

*string

Select a model on Replicate that will be the destination for the trained version. If the model does not exist, select the "Create model" option and a field will appear to enter the name of the new model. We'll create the model for you when you create the training.

file

Audio with voice to mimic

file

Existing weights to absorb