You're looking at a specific version of this model. Jump to the model overview.

wordscenes /whisperx:61393f7d

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
audio_path
string
Audio to transcribe or align
mode
None
transcribe
Mode: 'transcribe' to generate transcript, 'align' to align provided segments
segments
string
Segments (JSON array with text, start, and end keys) to align with audio (required when mode='align')
language
string
en
Language to transcribe

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'title': 'Output', 'type': 'string'}