erium / whisperx

Automatic Speech Recognition with Word-level Timestamps & Diarization

  • Public
  • 4.2K runs
  • A100 (80GB)
  • GitHub
  • Paper
  • License
Iterate in playground
  • Prediction

    erium/whisperx:73d8393ff3c06a6de2b03634c984912b5981f03b323f13715ceabfba071acd77
    ID
    hduvntbbvadmwa2dcfd3ko5xle
    Status
    Succeeded
    Source
    Web
    Hardware
    T4
    Total duration
    Created

    Input

    audio
    Video Player is loading.
    Current Time 00:00:000
    Duration 00:00:000
    Loaded: 0%
    Stream Type LIVE
    Remaining Time 00:00:000
     
    1x
    debug
    diarize
    language
    de
    batch_size
    32

    Output

    [{"text": " Ihr h\u00f6rt die IRIUM Podcast, der Data Science und Machine Learning Podcast f\u00fcr Young Professionals und Studienabsolventen, die wirklich wissen wollen, was in der Arbeitswelt abgeht.", "start": 0.009, "end": 10.742, "speaker": "SPEAKER_00"}]
    Generated in

Want to make some of these yourself?

Run this model