Readme
Incredibly Fast Whisper
Powered by 🤗 Transformers, Optimum & flash-attn
TL;DR - Transcribe 150 minutes of audio in 100 seconds - with OpenAI’s Whisper Large v3. Blazingly fast transcription is now a reality!⚡️
| Optimisation type | Time to Transcribe (150 mins of Audio) | 
|---|---|
| Transformers ( fp32) | ~31 (31 min 1 sec) | 
| Transformers ( fp16+batching [24]+bettertransformer) | ~5 (5 min 2 sec) | 
| Transformers ( fp16+batching [24]+Flash Attention 2) | ~2 (1 min 38 sec) | 
| distil-whisper ( fp16+batching [24]+bettertransformer) | ~3 (3 min 16 sec) | 
| distil-whisper ( fp16+batching [24]+Flash Attention 2) | ~1 (1 min 18 sec) | 
| Faster Whisper ( fp16+beam_size [1]) | ~9.23 (9 min 23 sec) | 
| Faster Whisper ( 8-bit+beam_size [1]) | ~8 (8 min 15 sec) | 
