Readme
This is a re-upload of the model: https://replicate.com/meronym/speaker-diarization however here we use an A100 GPU
Segments an audio recording based on who is speaking (on A100)
This model runs on Nvidia A100 (40GB) GPU hardware. Predictions typically complete within 20 seconds. The predict time for this model varies significantly based on the inputs.
This is a re-upload of the model: https://replicate.com/meronym/speaker-diarization however here we use an A100 GPU