SoTA Zero Shot Voice Cloning and TTS model
SoTA depth estimation
Kokoro is a frontier TTS model for its size of 82 million parameters (text in/audio out).