Collections

Streaming language models

Language models that support streaming responses. See https://replicate.com/docs/streaming

meta / llama-2-13b-chat

A 13 billion parameter language model from Meta, fine tuned for chat completions

1.5M runs

meta / llama-2-70b-chat

A 70 billion parameter language model from Meta, fine tuned for chat completions

1.3M runs

meta / llama-2-7b-chat

A 7 billion parameter language model from Meta, fine tuned for chat completions

282.2K runs

replicate / dolly-v2-12b

An open source instruction-tuned large language model developed by Databricks

202.1K runs

replicate / vicuna-13b

A large language model that's been fine-tuned on ChatGPT interactions

183.5K runs

joehoover / instructblip-vicuna13b

An instruction-tuned multi-modal model based on BLIP-2 and Vicuna-13B

178.9K runs

meta / llama-2-7b

Base version of Llama 2 7B, a 7 billion parameter language model

132K runs

replicate / flan-t5-xl

A language model by Google for tasks like classification, summarization, and more

85.3K runs

stability-ai / stablelm-tuned-alpha-7b

7 billion parameter version of Stability AI's language model

84.5K runs

replicate / llama-7b

Transformers implementation of the LLaMA language model

80.7K runs

fofr / prompt-classifier

Determines the toxicity of text to image prompts, llama-13b fine-tune. [SAFETY_RANKING] between 0 (safe) and 10 (toxic)

57K runs

meta / llama-2-70b

Base version of Llama 2, a 70 billion parameter language model from Meta.

44.3K runs

meta / codellama-13b

A 13 billion parameter Llama tuned for code completion

40.2K runs

joehoover / mplug-owl

An instruction-tuned multimodal large language model that generates text based on user-provided prompts and images

38K runs

replicate / oasst-sft-1-pythia-12b

An open source instruction-tuned large language model developed by Open-Assistant

22.4K runs

joehoover / falcon-40b-instruct

A 40 billion parameter language model trained to follow human instructions.

21.7K runs

fofr / image-prompts

Generate image prompts for Midjourney. Prefix inputs with "Image: "

14.7K runs

uwulewd / airoboros-llama-2-70b

Inference Airoboros L2 70B 2.1 - GPTQ using ExLlama.

12.8K runs

meta / llama-2-13b

Base version of Llama 2 13B, a 13 billion parameter language model

8.2K runs

meta / codellama-7b-instruct

A 7 billion parameter Llama tuned for coding and conversation

8.2K runs

meta / codellama-34b-instruct

A 34 billion parameter Llama tuned for coding and conversation

7.3K runs

replicate / mpt-7b-storywriter

A 7B parameter LLM fine-tuned to support contexts with more than 65K tokens

6.8K runs

meta / codellama-13b-instruct

A 13 billion parameter Llama tuned for coding and conversation

6.4K runs

meta / codellama-34b

A 34 billion parameter Llama tuned for coding and conversation

5.2K runs

gregwdata / defog-sqlcoder-q8

Defog's SQLCoder is a state-of-the-art LLM for converting natural language questions to SQL queries. SQLCoder is a 15B parameter fine-tuned on a base StarCoder model.

4.6K runs

meta / codellama-7b

A 7 billion parameter Llama tuned for coding and conversation

4.2K runs

joehoover / sql-generator

3.5K runs

nomagick / chatglm2-6b

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

3.1K runs

replicate / llama-13b-lora

Transformers implementation of the LLaMA 13B language model

2.8K runs

replicate / gpt-j-6b

A large language model by EleutherAI

2.7K runs

replit / replit-code-v1-3b

Generate code with Replit's replit-code-v1-3b large language model

1.6K runs

a16z-infra / mistral-7b-instruct-v0.1

An instruction-tuned 7 billion parameter language model from Mistral

1K runs

daanelson / flan-t5-large

A language model for tasks like classification, summarization, and more.

855 runs

nateraw / samsum-llama-2-13b

818 runs

meta / codellama-34b-python

A 34 billion parameter Llama tuned for coding with Python

711 runs

niron1 / qwen-7b-chat

Qwen-7B is the 7B-parameter version of the large language model series, Qwen (abbr. Tongyi Qianwen), proposed by Aibaba Cloud. Qwen-7B`is a Transformer-based large language model, which is pretrained on a large volume of data, including web texts, books,

644 runs

ruben-svensson / llama2-aqua-test1

605 runs

stability-ai / stablelm-base-alpha-7b

7B parameter base version of Stability AI's language model

519 runs

niron1 / openorca-platypus2-13b

OpenOrca-Platypus2-13B is a merge of garage-bAInd/Platypus2-13B and Open-Orca/OpenOrcaxOpenChat-Preview2-13B.

463 runs

fofr / star-trek-gpt-j-6b

gpt-j-6b trained on the Memory Alpha Star Trek Wiki

398 runs

fofr / llama2-prompter

Llama2 13b base model fine-tuned on text to image prompts

397 runs

replicate-internal / staging-llama-2-7b

378 runs

meta / codellama-7b-python

A 7 billion parameter Llama tuned for coding with Python

311 runs

a16z-infra / mistral-7b-v0.1

A 7 billion parameter language model from Mistral.

240 runs

niron1 / llama-2-7b-chat

LLAMA-2 7b chat version by Meta. Stream support. Unaltered prompt. Temperature working properly. Economical hardware.

208 runs

cbh123 / dylan-lyrics

Llama 2 13B fine-tuned on Bob Dylan lyrics

205 runs

stability-ai / stablelm-base-alpha-3b

3B parameter base version of Stability AI's language model

170 runs

nomagick / chatglm2-6b-int4

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型 (int4)

152 runs

fofr / star-trek-adventure

148 runs

nateraw / stablecode-completion-alpha-3b-4k

142 runs

m1guelpf / mario-gpt

Using language models to generate Super Mario Bros levels

132 runs

moinnadeem / fastervicuna_​13b

Re-implements LLaMa using a higher MFU implementation

119 runs

fofr / star-trek-flan

flan-t5-xl trained on the Memory Alpha Star Trek Wiki

118 runs

fofr / neuromancer-13b

llama-13b-base fine-tuned on Neuromancer style

115 runs

xrunda / med

114 runs

andreasjansson / wizardcoder-python-34b-v1-gguf

WizardCoder-python-34B-v1.0 with support for grammars and jsonschema

112 runs

fofr / star-trek-llama

llama-7b trained on the Memory Alpha Star Trek Wiki

110 runs

andreasjansson / llama-2-13b-gguf

Llama-2 13B with support for grammars and jsonschema

104 runs

nateraw / samsum-llama-7b

llama-2-7b fine-tuned on the samsum dataset for dialogue summarization

103 runs

meta / codellama-13b-python

A 13 billion parameter Llama tuned for coding with Python

96 runs

zeke / nyu-llama-2-7b-chat-training-test

A test model for fine-tuning Llama 2

94 runs

charles-dyfis-net / llama-2-13b-hf--lmtp-8bit

83 runs

andreasjansson / llama-2-13b-chat-gguf

Llama-2 13B chat with support for grammars and jsonschema

80 runs

andreasjansson / llama-2-70b-chat-gguf

Llama-2 70B chat with support for grammars and jsonschema

77 runs

tanzir11 / merge

73 runs

nateraw / llama-2-7b-chat-hf

60 runs

moinnadeem / codellama-34b-instruct-vllm

59 runs

andreasjansson / codellama-34b-instruct-gguf

CodeLlama-34B-instruct with support for grammars and jsonschema

55 runs

crowdy / line-lang-3.6b

an implementation of 3.6b Japanese large language model

51 runs

nateraw / wizardcoder-python-34b-v1.0

48 runs

replicate / elixir-gen

Fine-tuned Llama 13b on Elixir docstrings (WIP)

45 runs

cbh123 / homerbot

45 runs

cbh123 / samsum

43 runs

nateraw / codellama-7b-instruct-hf

36 runs

juanjaragavi / abby-llama-2-7b-chat

Abby is a stoic philosopher and a loving and caring mature woman.

36 runs

nateraw / aidc-ai-business-marcoroni-13b

35 runs

zallesov / super-real-llama2

29 runs

sruthiselvaraj / finetuned-llama2

29 runs

seanoliver / bob-dylan-fun-tuning

Llama fine-tune-athon project training llama2 on bob dylan lyrics.

24 runs

nwhitehead / llama2-7b-chat-gptq

23 runs

moinnadeem / vllm-engine-llama-7b

16 runs

charles-dyfis-net / llama-2-7b-hf--lmtp-4bit

15 runs

charles-dyfis-net / llama-2-13b-hf--lmtp

11 runs

divyavanmahajan / my-pet-llama

11 runs

juanjaragavi / abbot-llama-2-7b-chat

Abbot is brutally honest stoic philosopher. He is here to help the 'User' be their best self, no coddling.

11 runs

nateraw / codellama-7b

9 runs

nateraw / codellama-34b

8 runs

charles-dyfis-net / llama-2-13b-hf--lmtp-4bit

7 runs

nateraw / codellama-13b

6 runs

andreasjansson / codellama-7b-instruct-gguf

CodeLlama-7B-instruct with support for grammars and jsonschema

4 runs

nateraw / gairmath-abel-7b

4 runs

nateraw / codellama-7b-instruct

3 runs

nateraw / codellama-13b-instruct

2 runs