You're looking at a specific version of this model. Jump to the model overview.

thomasmol /whisper-diarization:d8bc5908

Input

string
Shift + Return to add a new line

Either provide: Base64 encoded audio file,

string
Shift + Return to add a new line

Or provide: A direct audio file URL

file

Or an audio file

integer
(minimum: 1, maximum: 50)

Number of speakers, leave empty to autodetect.

boolean

Translate the speech into English.

Default: false

string
Shift + Return to add a new line

Language of the spoken words as a language code like 'en'. Leave empty to auto detect language.

string
Shift + Return to add a new line

Vocabulary: provide names, acronyms and loanwords in a list. Use punctuation for best accuracy.

Output

No output yet! Press "Submit" to start a prediction.