lucataco/deepseek-ocr

Convert documents to markdown, extract raw text, and locate specific content

Public
161 runs

DeepSeek AI


Homepage Hugging Face

Discord Twitter Follow

🌟 Github | πŸ“₯ Model Download | πŸ“„ Paper Link | πŸ“„ Arxiv Paper Link |

DeepSeek-OCR: Contexts Optical Compression

Explore the boundaries of visual-text compression.

Usage

Inference using Huggingface transformers on NVIDIA GPUs. Requirements tested on python 3.12.9 + CUDA11.8:

vLLM

Refer to 🌟GitHub for guidance on model inference acceleration and PDF processing, etc.

Visualizations

Acknowledgement

We would like to thank Vary, GOT-OCR2.0, MinerU, PaddleOCR, OneChart, Slow Perception for their valuable models and ideas.

We also appreciate the benchmarks: Fox, OminiDocBench.