DeepSeek-V3-0324 is the leading non-reasoning model, a milestone for open source
DeepSeek LLM, an advanced language model comprising 67 billion parameters. Trained from scratch on a vast dataset of 2 trillion tokens in both English and Chinese
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Pushing the Limits of Mathematical Reasoning in Open Language Models - Base model
Pushing the Limits of Mathematical Reasoning in Open Language Models - Instruct model
A reasoning model trained with reinforcement learning, on par with OpenAI o1
DeepSeek-VL2, an advanced series of large Mixture-of-Experts (MoE) Vision-Language Models that significantly improves upon its predecessor, DeepSeek-VL
DeepSeek-VL2-small, an advanced series of large Mixture-of-Experts (MoE) Vision-Language Models that significantly improves upon its predecessor, DeepSeek-VL
DeepSeek-VL: An open-source Vision-Language Model designed for real-world vision and language understanding applications
Janus-Pro is a novel autoregressive framework for multimodal understanding
This model is warm. You'll get a fast response if the model is warm and already running, and a slower response if the model is cold and starting up.