ASR with word alignment based on whisperx using whisper medium (769M)
Want to make some of these yourself?