Whisper-Large-V2 + Pyannote 3.0 diarization via WhisperX
This model is cold. You'll get a fast response if the model is warm and already running, and a slower response if the model is cold and starting up.