Use LLMs
These large language models understand and generate natural language. They power chatbots, search engines, writing aids, and more.
Use these for:
- Conversational AI: Chat and engage in natural dialogue. Get an AI assistant.
- Question answering: Provide informative answers to questions. Build a knowledge base.
- Text generation: Generate fluent continuations of text. Autocomplete your writing.
- Summarization: Summarize long passages of text. Get key points quickly.
- Translation: Translate between languages. Communicate across language barriers.
Language models keep getting bigger and better at these tasks. The largest models today exhibit impressive reasoning skills. But you can get great results from smaller, faster, cheaper models too.
Featured models

anthropic / claude-4-sonnet
Claude Sonnet 4 is a significant upgrade to 3.7, delivering superior coding and reasoning while responding more precisely to your instructions
Updated 1 month ago

openai / o4-mini
OpenAI's fast, lightweight reasoning model
Updated 1 month ago

deepseek-ai / deepseek-r1
A reasoning model trained with reinforcement learning, on par with OpenAI o1
Updated 5 months, 2 weeks ago
Recommended models

openai / gpt-4.1-nano
Fastest, most cost-effective GPT-4.1 model from OpenAI
Updated 3 weeks, 4 days ago

openai / gpt-4.1-mini
Fast, affordable version of GPT-4.1
Updated 3 weeks, 4 days ago

openai / gpt-4o
OpenAI's high-intelligence chat model
Updated 3 weeks, 4 days ago

openai / gpt-4o-mini
Low latency, low cost version of OpenAI's GPT-4o model
Updated 3 weeks, 4 days ago

openai / o1
OpenAI's first o-series reasoning model
Updated 1 month ago

openai / gpt-4.1
OpenAI's Flagship GPT model for complex tasks.
Updated 1 month ago

openai / o1-mini
A small model alternative to o1
Updated 1 month, 4 weeks ago

ibm-granite / granite-3.3-8b-instruct
Granite-3.3-8B-Instruct is a 8-billion parameter 128K context length language model fine-tuned for improved reasoning and instruction-following capabilities.
Updated 3 months ago

deepseek-ai / deepseek-v3
DeepSeek-V3-0324 is the leading non-reasoning model, a milestone for open source
Updated 3 months, 3 weeks ago

anthropic / claude-3.7-sonnet
The most intelligent Claude model and the first hybrid reasoning model on the market (claude-3-7-sonnet-20250219)
Updated 4 months, 3 weeks ago

anthropic / claude-3.5-haiku
Anthropic's fastest, most cost-effective model, with a 200K token context window (claude-3-5-haiku-20241022)
Updated 5 months ago

anthropic / claude-3.5-sonnet
Anthropic's most intelligent language model to date, with a 200K token context window and image understanding (claude-3-5-sonnet-20241022)
Updated 5 months ago

meta / meta-llama-3.1-405b-instruct
Meta's flagship 405 billion parameter language model, fine-tuned for chat completions
Updated 11 months, 3 weeks ago

yorickvp / llava-13b
Visual instruction tuning towards large language and vision models with GPT-4 level capabilities
Updated 1 year ago

meta / meta-llama-3-70b
Base version of Llama 3, a 70 billion parameter language model from Meta.
Updated 1 year, 3 months ago

meta / meta-llama-3-70b-instruct
A 70 billion parameter language model from Meta, fine tuned for chat completions
Updated 1 year, 3 months ago

meta / meta-llama-3-8b-instruct
An 8 billion parameter language model from Meta, fine tuned for chat completions
Updated 1 year, 3 months ago

meta / meta-llama-3-8b
Base version of Llama 3, an 8 billion parameter language model from Meta.
Updated 1 year, 3 months ago

google-deepmind / gemma-2b-it
2B instruct version of Google’s Gemma model
Updated 1 year, 4 months ago

yorickvp / llava-v1.6-vicuna-13b
LLaVA v1.6: Large Language and Vision Assistant (Vicuna-13B)
Updated 1 year, 5 months ago

yorickvp / llava-v1.6-mistral-7b
LLaVA v1.6: Large Language and Vision Assistant (Mistral-7B)
Updated 1 year, 5 months ago

stability-ai / stablelm-tuned-alpha-7b
7 billion parameter version of Stability AI's language model
Updated 2 years, 2 months ago

replicate / flan-t5-xl
A language model by Google for tasks like classification, summarization, and more
Updated 2 years, 3 months ago

replicate / llama-7b
Transformers implementation of the LLaMA language model
Updated 2 years, 4 months ago