You're looking at a specific version of this model. Jump to the model overview.

erium /whisperx:739fba6b

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
audio
string
Audio file
batch_size
integer
32

Min: 1

The number of batches that are run in parallel.
diarize
boolean
False
Use this to identify speakers
min_speakers
integer

Min: 1

Minimum number of speakers in case of diarization.
max_speakers
integer

Min: 1

Maximum number of speakers in case of diarization.
language
string (enum)
de

Options:

en, fr, de, es, it, ja, zh, nl, uk, pt

The audio file's language.
debug
boolean
False
Print out memory usage information.

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'title': 'Output', 'type': 'string'}