deepseek-ai / janus-pro-1b

Janus-Pro is a novel autoregressive framework for multimodal understanding

6.7K runs
Public

deepseek-ai / deepseek-vl2

DeepSeek-VL2, an advanced series of large Mixture-of-Experts (MoE) Vision-Language Models that significantly improves upon its predecessor, DeepSeek-VL

52.2K runs
Public

deepseek-ai / deepseek-vl2-small

DeepSeek-VL2-small, an advanced series of large Mixture-of-Experts (MoE) Vision-Language Models that significantly improves upon its predecessor, DeepSeek-VL

675 runs
Public

deepseek-ai / janus-pro-7b

Janus-Pro is a novel autoregressive framework for multimodal understanding

10.7K runs
Public

deepseek-ai / deepseek-r1

A reasoning model trained with reinforcement learning, on par with OpenAI o1

1.1M runs
Public

deepseek-ai / deepseek-coder-v2-lite-instruct

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

426 runs
Public

deepseek-ai / deepseek-67b-base

DeepSeek LLM, an advanced language model comprising 67 billion parameters. Trained from scratch on a vast dataset of 2 trillion tokens in both English and Chinese

456 runs
Public

deepseek-ai / deepseek-vl-7b-base

DeepSeek-VL: An open-source Vision-Language Model designed for real-world vision and language understanding applications

3.5K runs
Public

deepseek-ai / deepseek-math-7b-instruct

Pushing the Limits of Mathematical Reasoning in Open Language Models - Instruct model

1.6K runs
Public

deepseek-ai / deepseek-math-7b-base

Pushing the Limits of Mathematical Reasoning in Open Language Models - Base model

2.1K runs
Public

deepseek-ai / deepseek-v3

DeepSeek-V3-0324 is the leading non-reasoning model, a milestone for open source

1.4M runs
Public