Streaming language models
Language models that support streaming responses. See https://replicate.com/docs/streaming

meta / llama-2-13b-chat
A 13 billion parameter language model from Meta, fine tuned for chat completions

meta / llama-2-70b-chat
A 70 billion parameter language model from Meta, fine tuned for chat completions

meta / llama-2-7b-chat
A 7 billion parameter language model from Meta, fine tuned for chat completions

replicate / dolly-v2-12b
An open source instruction-tuned large language model developed by Databricks

replicate / vicuna-13b
A large language model that's been fine-tuned on ChatGPT interactions

joehoover / instructblip-vicuna13b
An instruction-tuned multi-modal model based on BLIP-2 and Vicuna-13B

meta / llama-2-7b
Base version of Llama 2 7B, a 7 billion parameter language model

replicate / flan-t5-xl
A language model by Google for tasks like classification, summarization, and more

stability-ai / stablelm-tuned-alpha-7b
7 billion parameter version of Stability AI's language model

replicate / llama-7b
Transformers implementation of the LLaMA language model

fofr / prompt-classifier
Determines the toxicity of text to image prompts, llama-13b fine-tune. [SAFETY_RANKING] between 0 (safe) and 10 (toxic)

meta / llama-2-70b
Base version of Llama 2, a 70 billion parameter language model from Meta.

meta / codellama-13b
A 13 billion parameter Llama tuned for code completion

joehoover / mplug-owl
An instruction-tuned multimodal large language model that generates text based on user-provided prompts and images

replicate / oasst-sft-1-pythia-12b
An open source instruction-tuned large language model developed by Open-Assistant

joehoover / falcon-40b-instruct
A 40 billion parameter language model trained to follow human instructions.

fofr / image-prompts
Generate image prompts for Midjourney. Prefix inputs with "Image: "

uwulewd / airoboros-llama-2-70b
Inference Airoboros L2 70B 2.1 - GPTQ using ExLlama.

meta / llama-2-13b
Base version of Llama 2 13B, a 13 billion parameter language model

meta / codellama-7b-instruct
A 7 billion parameter Llama tuned for coding and conversation

meta / codellama-34b-instruct
A 34 billion parameter Llama tuned for coding and conversation

replicate / mpt-7b-storywriter
A 7B parameter LLM fine-tuned to support contexts with more than 65K tokens

meta / codellama-13b-instruct
A 13 billion parameter Llama tuned for coding and conversation

meta / codellama-34b
A 34 billion parameter Llama tuned for coding and conversation

gregwdata / defog-sqlcoder-q8
Defog's SQLCoder is a state-of-the-art LLM for converting natural language questions to SQL queries. SQLCoder is a 15B parameter fine-tuned on a base StarCoder model.

meta / codellama-7b
A 7 billion parameter Llama tuned for coding and conversation
joehoover / sql-generator

nomagick / chatglm2-6b
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

replicate / llama-13b-lora
Transformers implementation of the LLaMA 13B language model

replicate / gpt-j-6b
A large language model by EleutherAI

replit / replit-code-v1-3b
Generate code with Replit's replit-code-v1-3b large language model

a16z-infra / mistral-7b-instruct-v0.1
An instruction-tuned 7 billion parameter language model from Mistral

daanelson / flan-t5-large
A language model for tasks like classification, summarization, and more.
nateraw / samsum-llama-2-13b

meta / codellama-34b-python
A 34 billion parameter Llama tuned for coding with Python

niron1 / qwen-7b-chat
Qwen-7B is the 7B-parameter version of the large language model series, Qwen (abbr. Tongyi Qianwen), proposed by Aibaba Cloud. Qwen-7B`is a Transformer-based large language model, which is pretrained on a large volume of data, including web texts, books,
ruben-svensson / llama2-aqua-test1

stability-ai / stablelm-base-alpha-7b
7B parameter base version of Stability AI's language model

niron1 / openorca-platypus2-13b
OpenOrca-Platypus2-13B is a merge of garage-bAInd/Platypus2-13B and Open-Orca/OpenOrcaxOpenChat-Preview2-13B.

fofr / star-trek-gpt-j-6b
gpt-j-6b trained on the Memory Alpha Star Trek Wiki

fofr / llama2-prompter
Llama2 13b base model fine-tuned on text to image prompts
replicate-internal / staging-llama-2-7b

meta / codellama-7b-python
A 7 billion parameter Llama tuned for coding with Python

a16z-infra / mistral-7b-v0.1
A 7 billion parameter language model from Mistral.

niron1 / llama-2-7b-chat
LLAMA-2 7b chat version by Meta. Stream support. Unaltered prompt. Temperature working properly. Economical hardware.
cbh123 / dylan-lyrics
Llama 2 13B fine-tuned on Bob Dylan lyrics

stability-ai / stablelm-base-alpha-3b
3B parameter base version of Stability AI's language model

nomagick / chatglm2-6b-int4
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型 (int4)
fofr / star-trek-adventure
nateraw / stablecode-completion-alpha-3b-4k
m1guelpf / mario-gpt
Using language models to generate Super Mario Bros levels
moinnadeem / fastervicuna_13b
Re-implements LLaMa using a higher MFU implementation

fofr / star-trek-flan
flan-t5-xl trained on the Memory Alpha Star Trek Wiki

fofr / neuromancer-13b
llama-13b-base fine-tuned on Neuromancer style

xrunda / med

andreasjansson / wizardcoder-python-34b-v1-gguf
WizardCoder-python-34B-v1.0 with support for grammars and jsonschema

fofr / star-trek-llama
llama-7b trained on the Memory Alpha Star Trek Wiki

andreasjansson / llama-2-13b-gguf
Llama-2 13B with support for grammars and jsonschema

nateraw / samsum-llama-7b
llama-2-7b fine-tuned on the samsum dataset for dialogue summarization

meta / codellama-13b-python
A 13 billion parameter Llama tuned for coding with Python
zeke / nyu-llama-2-7b-chat-training-test
A test model for fine-tuning Llama 2
charles-dyfis-net / llama-2-13b-hf--lmtp-8bit

andreasjansson / llama-2-13b-chat-gguf
Llama-2 13B chat with support for grammars and jsonschema

andreasjansson / llama-2-70b-chat-gguf
Llama-2 70B chat with support for grammars and jsonschema
tanzir11 / merge
nateraw / llama-2-7b-chat-hf
moinnadeem / codellama-34b-instruct-vllm

andreasjansson / codellama-34b-instruct-gguf
CodeLlama-34B-instruct with support for grammars and jsonschema

crowdy / line-lang-3.6b
an implementation of 3.6b Japanese large language model
nateraw / wizardcoder-python-34b-v1.0
replicate / elixir-gen
Fine-tuned Llama 13b on Elixir docstrings (WIP)
cbh123 / homerbot
cbh123 / samsum
nateraw / codellama-7b-instruct-hf

juanjaragavi / abby-llama-2-7b-chat
Abby is a stoic philosopher and a loving and caring mature woman.
nateraw / aidc-ai-business-marcoroni-13b
zallesov / super-real-llama2
sruthiselvaraj / finetuned-llama2
seanoliver / bob-dylan-fun-tuning
Llama fine-tune-athon project training llama2 on bob dylan lyrics.
nwhitehead / llama2-7b-chat-gptq
moinnadeem / vllm-engine-llama-7b
charles-dyfis-net / llama-2-7b-hf--lmtp-4bit
charles-dyfis-net / llama-2-13b-hf--lmtp
divyavanmahajan / my-pet-llama

juanjaragavi / abbot-llama-2-7b-chat
Abbot is brutally honest stoic philosopher. He is here to help the 'User' be their best self, no coddling.
nateraw / codellama-7b
nateraw / codellama-34b
charles-dyfis-net / llama-2-13b-hf--lmtp-4bit
nateraw / codellama-13b

andreasjansson / codellama-7b-instruct-gguf
CodeLlama-7B-instruct with support for grammars and jsonschema
nateraw / gairmath-abel-7b
nateraw / codellama-7b-instruct
nateraw / codellama-13b-instruct