lightweight text-to-speech (TTS) model, trained on 10.5K hours of audio data (Updated 1 year, 2 months ago)