You're looking at a specific version of this model. Jump to the model overview.

ghostljj /deepseek-ocr:037d8e61

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
mode
None
gundam
Processing mode: tiny (512×512), small (640×640), base (1024×1024), large (1280×1280), or gundam (adaptive 640+1024)
task_type
None
markdown
Task type: markdown (convert to markdown), ocr (standard OCR), free_ocr (OCR without layout), parse_figure (parse figures/diagrams), describe (detailed description), or locate (locate objects by reference)
image
string
Input image URL or path for OCR
custom_prompt
string
Custom prompt (optional, overrides task_type if provided)

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'title': 'Output', 'type': 'string'}