You're looking at a specific version of this model. Jump to the model overview.
victor-upmeet /whisperx:b54d330a
Input schema
The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.
Field | Type | Default value | Description |
---|---|---|---|
audio_file |
string
|
Audio file
|
|
language |
string
|
ISO code of the language spoken in the audio, specify None to perform language detection
|
|
initial_prompt |
string
|
Optional text to provide as a prompt for the first window
|
|
batch_size |
integer
|
64
|
Parallelization of input audio transcription
|
temperature |
number
|
0
|
Temperature to use for sampling
|
debug |
boolean
|
False
|
Print out compute/inference times and memory usage information.
|
Output schema
The shape of the response you’ll get when you run this model with an API.
Schema
{'properties': {'detected_language': {'title': 'Detected Language',
'type': 'string'},
'segments': {'title': 'Segments'}},
'required': ['detected_language'],
'title': 'ModelOutput',
'type': 'object'}