You're looking at a specific version of this model. Jump to the model overview.
viktorfa /whisperx-api:e7079a16
Input schema
The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.
Field | Type | Default value | Description |
---|---|---|---|
audio |
string
|
Audio file
|
|
model |
string
(enum)
|
large-v2
Options: tiny, base, small, medium, large-v2 |
Whisper model to use
|
align |
boolean
|
True
|
Make word-level timestamps (Takes longer time)
|
num_speakers |
integer
|
0
|
Identify speakers (0 to not diarize)
|
lang |
string
|
Language (Empty for automatic)
|
|
batch_size |
integer
|
16
|
Batch size
|
debug |
boolean
|
False
|
Print out memory usage information.
|
Output schema
The shape of the response you’ll get when you run this model with an API.
Schema
{'properties': {'alignResult': {'format': 'uri',
'title': 'Alignresult',
'type': 'string'},
'asrResult': {'format': 'uri',
'title': 'Asrresult',
'type': 'string'},
'jsonData': {'title': 'Jsondata', 'type': 'string'},
'text': {'format': 'uri', 'title': 'Text', 'type': 'string'}},
'required': ['jsonData', 'text', 'asrResult'],
'title': 'Output',
'type': 'object'}