lucataco/deepseek-ocr

Convert documents to markdown, extract raw text, and locate specific content

Public
132 runs

Run time and cost

This model costs approximately $0.029 to run on Replicate, or 34 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia L40S GPU hardware. Predictions typically complete within 30 seconds. The predict time for this model varies significantly based on the inputs.

Readme

DeepSeek AI


Homepage Hugging Face

Discord Twitter Follow

🌟 Github | πŸ“₯ Model Download | πŸ“„ Paper Link | πŸ“„ Arxiv Paper Link |

DeepSeek-OCR: Contexts Optical Compression

Explore the boundaries of visual-text compression.

Usage

Inference using Huggingface transformers on NVIDIA GPUs. Requirements tested on python 3.12.9 + CUDA11.8:

vLLM

Refer to 🌟GitHub for guidance on model inference acceleration and PDF processing, etc.

Visualizations

Acknowledgement

We would like to thank Vary, GOT-OCR2.0, MinerU, PaddleOCR, OneChart, Slow Perception for their valuable models and ideas.

We also appreciate the benchmarks: Fox, OminiDocBench.