You're looking at a specific version of this model. Jump to the model overview.
Input schema
The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.
Field | Type | Default value | Description |
---|---|---|---|
audio |
string
|
Audio file
|
|
model |
string
(enum)
|
base
Options: tiny, base, small, medium, large-v1, large-v2 |
Choose a Whisper model.
|
transcription |
string
(enum)
|
plain text
Options: plain text, srt, vtt |
Choose the format for the transcription
|
translate |
boolean
|
False
|
Translate the text to English when set to True
|
Output schema
The shape of the response you’ll get when you run this model with an API.
Schema
{'properties': {'detected_language': {'title': 'Detected Language',
'type': 'string'},
'segments': {'title': 'Segments'},
'srt_file': {'format': 'uri',
'title': 'Srt File',
'type': 'string'},
'transcription': {'title': 'Transcription', 'type': 'string'},
'translation': {'title': 'Translation', 'type': 'string'},
'txt_file': {'format': 'uri',
'title': 'Txt File',
'type': 'string'}},
'required': ['detected_language', 'transcription'],
'title': 'ModelOutput',
'type': 'object'}