You're looking at a specific version of this model. Jump to the model overview.
Input schema
The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.
| Field | Type | Default value | Description |
|---|---|---|---|
| video |
string
|
Input video file or URL
|
|
| region |
string
|
Subtitle region (format: x_min,y_min,x_max,y_max, e.g., 0,900,1920,1080)
|
|
| center_point |
string
|
Center point coordinate (format: x,y, e.g., 960,1000). Only extract subtitles near this point
|
|
| min_score |
number
|
0.7
Max: 1 |
Minimum OCR confidence threshold
|
| similarity |
number
|
0.7
Max: 1 |
Text similarity threshold for merging subtitles
|
| match_mode |
None
|
contain
|
Region matching mode
|
| target_height |
number
|
Target subtitle font height in pixels (also used as center-radius)
|
|
| height_threshold |
number
|
0.2
Max: 1 |
Font height difference threshold (ratio)
|
| workers |
integer
|
8
Min: 1 Max: 16 |
Number of worker threads for OCR
|
Output schema
The shape of the response you’ll get when you run this model with an API.
Schema
{'title': 'Output', 'type': 'string'}