You're looking at a specific version of this model. Jump to the model overview.

aodianyun /ocrsrt:a02ddebf

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
video
string
Input video file or URL
region
string
Subtitle region (format: x_min,y_min,x_max,y_max, e.g., 0,900,1920,1080)
center_point
string
Center point coordinate (format: x,y, e.g., 960,1000). Only extract subtitles near this point
min_score
number
0.7

Max: 1

Minimum OCR confidence threshold
similarity
number
0.7

Max: 1

Text similarity threshold for merging subtitles
match_mode
None
contain
Region matching mode
target_height
number
Target subtitle font height in pixels (also used as center-radius)
height_threshold
number
0.2

Max: 1

Font height difference threshold (ratio)
workers
integer
8

Min: 1

Max: 16

Number of worker threads for OCR

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'title': 'Output', 'type': 'string'}