You're looking at a specific version of this model. Jump to the model overview.

jigsawstack /speech-to-text:b5c38163

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
url
string
The video/audio URL. Not required if file_store_key is specified.
file_store_key
string
The key used to store the video/audio file on Jigsawstack File Storage. Not required if url is specified.
language
string
The language to transcribe or translate the file into. If not specified, the model will automatically detect the language.
translate
boolean
False
When set to true, translates the content into English (or the specified language if language parameter is provided).
by_speaker
boolean
False
Identifies and separates different speakers in the audio file.
webhook_url
string
Webhook URL to send result to. When provided, the API will process asynchronously.
batch_size
integer
30

Min: 1

Max: 40

The batch size to return. Maximum value is 40. This controls how the audio is chunked for processing.
api_key
string
🔐 Your JigsawStack API Key (required)

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'title': 'Output', 'type': 'string'}