Collections

Generate text

These large language models understand and generate natural language. They power chatbots, search engines, writing aids, and more.

Use these for:

  • Conversational AI: Chat and engage in natural dialogue. Get an AI assistant.
  • Question answering: Provide informative answers to questions. Build a knowledge base.
  • Text generation: Generate fluent continuations of text. Autocomplete your writing.
  • Summarization: Summarize long passages of text. Get key points quickly.
  • Translation: Translate between languages. Communicate across language barriers.

Language models keep getting bigger and better at these tasks. The largest models today exhibit impressive reasoning skills. But you can get great results from smaller, faster, cheaper models too.

Recommended models

openai / o1-mini

A small model alternative to o1

6 runs

openai / gpt-4o

OpenAI's high-intelligence chat model

13.6K runs

openai / gpt-4o-mini

Low latency, low cost version of OpenAI's GPT-4o model

1.4K runs

openai / gpt-4.1-nano

Fastest, most cost-effective GPT-4.1 model from OpenAI

151 runs

openai / gpt-4.1-mini

Fast, affordable version of GPT-4.1

101 runs

anthropic / claude-3.5-sonnet

Anthropic's most intelligent language model to date, with a 200K token context window and image understanding (claude-3-5-sonnet-20241022)

479.7K runs

meta / meta-llama-3-70b

Base version of Llama 3, a 70 billion parameter language model from Meta.

829.8K runs

meta / meta-llama-3-70b-instruct

A 70 billion parameter language model from Meta, fine tuned for chat completions

152.7M runs

meta / meta-llama-3-8b-instruct

An 8 billion parameter language model from Meta, fine tuned for chat completions

360.6M runs

meta / meta-llama-3-8b

Base version of Llama 3, an 8 billion parameter language model from Meta.

50.9M runs

google-deepmind / gemma-7b

7B base version of Google’s Gemma model

7.7K runs

google-deepmind / gemma-2b

2B base version of Google’s Gemma model

2.4K runs

google-deepmind / gemma-7b-it

7B instruct version of Google’s Gemma model

88.5K runs

google-deepmind / gemma-2b-it

2B instruct version of Google’s Gemma model

133.5K runs

lucataco / phixtral-2x2_​8

phixtral-2x2_8 is the first Mixure of Experts (MoE) made with two microsoft/phi-2 models, inspired by the mistralai/Mixtral-8x7B-v0.1 architecture

1.5K runs

lucataco / qwen1.5-72b

Qwen1.5 is the beta version of Qwen2, a transformer-based decoder-only language model pretrained on a large amount of data

4.1K runs

lucataco / qwen1.5-7b

Qwen1.5 is the beta version of Qwen2, a transformer-based decoder-only language model pretrained on a large amount of data

3.5K runs

adirik / mamba-2.8b

Base version of Mamba 2.8B, a 2.8 billion parameter state space language model

834 runs

adirik / mamba-130m

Base version of Mamba 130M, a 130 million parameter state space language model

142 runs

adirik / mamba-370m

Base version of Mamba 370M, a 370 million parameter state space language model

53 runs

adirik / mamba-790m

Base version of Mamba 790M, a 790 million parameter state space language model

51 runs

adirik / mamba-2.8b-slimpj

Base version of Mamba 2.8B Slim Pyjama, a 2.8 billion parameter state space language model

75 runs

adirik / mamba-1.4b

Base version of Mamba 1.4B, a 1.4 billion parameter state space language model

104 runs

lucataco / phi-2

Phi-2 by Microsoft

3.6K runs

nateraw / nous-hermes-2-solar-10.7b

Nous Hermes 2 - SOLAR 10.7B is the flagship Nous Research model on the SOLAR 10.7B base model..

70.6K runs

kcaverly / nous-hermes-2-yi-34b-gguf

Nous Hermes 2 - Yi-34B is a state of the art Yi Fine-tune, fine tuned on GPT-4 generated synthetic data

11.6K runs

01-ai / yi-34b-chat

The Yi series models are large language models trained from scratch by developers at 01.AI.

320.2K runs

01-ai / yi-6b-chat

The Yi series models are large language models trained from scratch by developers at 01.AI.

8.2K runs

01-ai / yi-6b

The Yi series models are large language models trained from scratch by developers at 01.AI.

161.1K runs

nateraw / nous-hermes-llama2-awq

TheBloke/Nous-Hermes-Llama2-AWQ served with vLLM

7.3K runs

stability-ai / stablelm-tuned-alpha-7b

7 billion parameter version of Stability AI's language model

140.5K runs

replicate / flan-t5-xl

A language model by Google for tasks like classification, summarization, and more

150.8K runs

replicate / gpt-j-6b

A large language model by EleutherAI

9.6K runs

replicate / llama-7b

Transformers implementation of the LLaMA language model

99.2K runs