These models perform optical character recognition, extracting text from images. They can help digitize text from scanned documents, photos, and other visual media.
For most OCR tasks, we recommend the abiruyt/text-extract-ocr model. This versatile tool makes it simple to extract plain text from a wide variety of images.
To get clean markdown or JSON from PDF, epub, or other document formats, use Marker. It's a pipeline of models that supports all languages, removes headers and footers, formats equations and code blocks, and more. It can also OCR text from PDFs saved in image format.
Some other useful models for your text extraction pipeline:
Featured models
Recommended Models
If you want to extract text from images as quickly as possible, abiruyt/text-extract-ocr is a great starting point. It handles a wide range of image types—from clean scans to photos—and delivers text output in seconds.
For document-level extraction, cuuupid/marker is optimized for batch processing and converts files rapidly to structured formats like Markdown or JSON.
abiruyt/text-extract-ocr provides a strong balance between speed, accuracy, and simplicity. It’s a reliable default for most OCR pipelines.
If you need higher precision and structured results (like clean markdown output or proper formatting of code blocks), cuuupid/marker is a step up in quality and formatting control.
When you’re working with scanned PDFs, books, or reports, cuuupid/marker and lucataco/deepseek-ocr excel. They handle multi-column layouts, headings, and embedded media while removing unwanted artifacts like headers and footers.
These models also support non-Latin scripts and preserve structured information, making them suitable for academic or archival work.
If your goal is to recover LaTeX or math expressions, mickeybeurskens/latex-ocr specializes in converting equation images directly into LaTeX code.
For receipts, forms, or invoices, willywongi/donut extracts structured JSON with fields like item names, prices, and totals—perfect for automation or data pipelines.
OCR models here generally fall into two groups:
If you need layout or structure, use a document-understanding model; if you only need raw text, use a plain-text OCR model.
Most models output plain text or structured text files (Markdown, JSON, or LaTeX).
For instance, cuuupid/marker outputs Markdown or JSON with headings and code blocks preserved, while willywongi/donut returns structured data fields extracted from images.
Many OCR models are open source and can be self-hosted with Cog or Docker.
To publish your own version, include a replicate.yaml file defining inputs (like image or pdf_file) and outputs (text or json_output), then push it to your Replicate account to run on managed GPUs.
Yes—most OCR models are approved for commercial use. Always review the License section on the model’s page, since some models (especially those trained on academic datasets) may restrict redistribution or require attribution.
Go to a model’s page on Replicate, upload your image or PDF, and run it.
For example, abiruyt/text-extract-ocr will return plain text immediately, while cuuupid/marker lets you choose between output types like Markdown or JSON. Some models also support batch uploads for larger jobs.
Recommended Models

bytedance/dolphin
Document Image Parsing via Heterogeneous Anchor Prompting
Updated 3 weeks, 2 days ago
1.1K runs


lucataco/deepseek-ocr
Convert documents to markdown, extract raw text, and locate specific content
Updated 3 weeks, 4 days ago
29K runs

datalab-to/ocr
Detect and transcribe text in images with accurate bounding boxes, layout analysis, reding order, and table recognition, in 90 languages
Updated 3 weeks, 6 days ago
1.1K runs


pbevan1/llama-3.1-8b-ocr-correction
LLaMA 3.1-8B, finetuned on a synthetic OCR dataset for superior OCR correction.
Updated 1 year, 3 months ago
52 runs


cuuupid/glm-4v-9b
GLM-4V is a multimodal model released by Tsinghua University that is competitive with GPT-4o and establishes a new SOTA on several benchmarks, including OCR.
Updated 1 year, 4 months ago
92.4K runs


cudanexus/ocr-surya
Surya is a document OCR toolkit that does:
Updated 1 year, 8 months ago
6.6K runs


cuuupid/marker
Convert scanned or electronic documents to markdown, very very very fast
Updated 1 year, 11 months ago
2.9K runs


mickeybeurskens/latex-ocr
Optical character recognition to turn images of latex equations into latex format.
Updated 2 years ago
872 runs


abiruyt/text-extract-ocr
A simple OCR Model that can easily extract text from an image.
Updated 2 years ago
90M runs


awilliamson10/meta-nougat
Nougat: Neural Optical Understanding for Academic Documents
Updated 2 years, 2 months ago
4.9K runs


kshitijagrwl/pii-extractor-llm
PII Data Extraction from Text
Updated 2 years, 3 months ago
167 runs


willywongi/donut
Extract structured data from receipt images using Donut 🍩 (Document Understanding Transformer)
Updated 2 years, 6 months ago
2.2K runs


cjwbw/docentr
End-to-End Document Image Enhancement Transformer
Updated 3 years, 2 months ago
4.7K runs