You're looking at a specific version of this model. Jump to the model overview.

thomasmol /whisper-diarization:d606f280

Input

string
Shift + Return to add a new line

Either provide: Base64 encoded audio file,

string
Shift + Return to add a new line

Or provide: A direct audio file URL

file

Or an audio file

boolean

Group segments of same speaker shorter apart than 2 seconds

Default: true

integer
(minimum: 1, maximum: 50)

Number of speakers

Default: 2

string
Shift + Return to add a new line

Prompt, to be used as context

Default: "Some people speaking."

integer
(minimum: 0)

Offset in seconds, used for chunked inputs

Default: 0

Output

No output yet! Press "Submit" to start a prediction.