sidedwards / whisperx

WhisperX with accelerated transcription and advanced speaker diarization provides fast and accurate transcriptions with speaker segments.

  • Public
  • 192 runs
  • GitHub

Run time and cost

This model runs on Nvidia L40S GPU hardware. We don't yet have enough runs of this model to provide performance information.

Readme

WhisperX with accelerated transcription and advanced speaker diarization provides fast and accurate transcriptions with speaker segments.

Based on: https://github.com/victor-upmeet/whisperx-replicate

Citation

@misc{bain2023whisperx,
      title={WhisperX: Time-Accurate Speech Transcription of Long-Form Audio}, 
      author={Max Bain and Jaesung Huh and Tengda Han and Andrew Zisserman},
      year={2023},
      eprint={2303.00747},
      archivePrefix={arXiv},
      primaryClass={cs.SD}
}

For more information, visit the WhisperX GitHub repository.