Generate subtitles (
.vtt) from audio files using OpenAI's Whisper models.
Using faster-whisper, a reimplementation of OpenAI's Whisper model using CTranslate2, which is a fast inference engine for Transformer models.
This is a fork of m1guelpf/whisper-subtitles with added support for VAD, selecting a language, use the language specific models and download the
.srt files directly from the result.