Streaming language models
Language models that support streaming responses. See https://replicate.com/docs/streaming
Recommended models

lucataco / qwq-32b
QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning
Updated 6 months, 1 week ago

ibm-granite / granite-vision-3.2-2b
Granite-Vision-3.2-2B is a compact and efficient vision-language model, specifically designed for visual document understanding.
Updated 6 months, 1 week ago

deep-sphere-ai / expert-research-ai
No pages por DeepResearch. Este nuevo agente de investigación viene a resolver todo por ti, solo dale tiempo :D
Updated 6 months, 1 week ago

hayooucom / vision-model2
welcome to contact us. youkpan@gmail.com
Updated 6 months, 2 weeks ago

anthropic / claude-3.7-sonnet
The most intelligent Claude model and the first hybrid reasoning model on the market (claude-3-7-sonnet-20250219)
Updated 6 months, 2 weeks ago

lucataco / r1-1776-70b
A version of the DeepSeek-R1 model that has been post trained to provide unbiased, accurate, and factual information by Perplexity
Updated 6 months, 3 weeks ago

anthropic / claude-3.5-haiku
Anthropic's fastest, most cost-effective model, with a 200K token context window (claude-3-5-haiku-20241022)
Updated 7 months ago

anthropic / claude-3.5-sonnet
Anthropic's most intelligent language model to date, with a 200K token context window and image understanding (claude-3-5-sonnet-20241022)
Updated 7 months ago

edoproch / deepseekr1-distilled-llama-70b-ollama
DeepSeek-R1 distilled on LLaMA3.3 70B and quantized by ollama
Updated 7 months, 2 weeks ago

edoproch / deepseekr1-distilled-llama-8b-ollama
DeepSeek-R1 distilled on LLaMA 8B
Updated 7 months, 2 weeks ago

deepseek-ai / deepseek-r1
A reasoning model trained with reinforcement learning, on par with OpenAI o1
Updated 7 months, 2 weeks ago

lucataco / deepseek-r1-70b
DeepSeek's first generation reasoning models with comparable performance to OpenAI-o1
Updated 7 months, 3 weeks ago

ibm-granite / granite-3.1-8b-instruct
Granite-3.1-8B-Instruct is a lightweight and open-source 8B parameter model is designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.
Updated 8 months, 3 weeks ago

ibm-granite / granite-3.1-2b-instruct
Granite-3.1-2B-Instruct is a lightweight and open-source 2B parameter model designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.
Updated 8 months, 3 weeks ago

lucataco / ollama-llama3.2-vision-90b
Ollama Llama 3.2 Vision 90B
Updated 8 months, 4 weeks ago

lucataco / ollama-llama3.2-vision-11b
Ollama Llama 3.2 Vision 11B
Updated 8 months, 4 weeks ago

lucataco / ollama-qwq
Ollama QwQ 32B
Updated 8 months, 4 weeks ago

lucataco / ollama-llama3.3-70b
Ollama Llama 3.3 70B
Updated 8 months, 4 weeks ago

pku-yuangroup / llava-cot
Let Vision Language Models Reason Step-by-Step
Updated 9 months, 1 week ago

zhouhaojiang / qwen_32b
without examination qwen2.5 32b
Updated 9 months, 3 weeks ago

lucataco / llama-3-vision-alpha
Projection module trained to add vision capabilties to Llama 3 using SigLIP
Updated 10 months, 1 week ago

lucataco / ollama-nemotron-70b
Ollama Nemotron 70b
Updated 10 months, 4 weeks ago

ibm-granite / granite-3.0-8b-instruct
Granite-3.0-8B-Instruct is a lightweight and open-source 8B parameter model is designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.
Updated 10 months, 4 weeks ago

ibm-granite / granite-3.0-2b-instruct
Granite-3.0-2B-Instruct is a lightweight and open-source 2B parameter model designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.
Updated 10 months, 4 weeks ago

justmalhar / reader-lm
Reader-LM is a series of models that convert HTML content to Markdown content
Updated 11 months ago

nousresearch / hermes-2-theta-llama-8b
Hermes-2 Θ (Theta) is the first experimental merged model released by Nous Research, in collaboration with Charles Goddard at Arcee, the team behind MergeKit.
Updated 11 months ago

justmalhar / meta-llama-3.2-3b
Meta Llama 3.2 1B
Updated 11 months, 2 weeks ago

justmalhar / meta-llama-3.2-1b
Meta Llama 3.2 1B
Updated 11 months, 2 weeks ago

lucataco / ollama-qwen2.5-72b
Ollama Qwen2.5 72b
Updated 11 months, 3 weeks ago

aodianyun / minicpm-v-26
Updated 1 year ago

aodianyun / minicpm-v-26-int4
Updated 1 year ago

lucataco / ollama-reflection-70b
Ollama Reflection 70b
Updated 1 year ago

ibm-granite / granite-8b-code-instruct-128k
Join the Granite community where you can find numerous recipe workbooks to help you get started with a wide variety of use cases using this model. https://github.com/ibm-granite-community
Updated 1 year ago

ibm-granite / granite-20b-code-instruct-8k
Join the Granite community where you can find numerous recipe workbooks to help you get started with a wide variety of use cases using this model. https://github.com/ibm-granite-community
Updated 1 year ago

interact-brands / llava-13b-spotter-creator
Fine-tuned LLaVa model for youtube thumbnail classification
Updated 1 year, 1 month ago

google-deepmind / gemma-2-2b-it
Gemma2 2b Instruction-tuned variant by Google
Updated 1 year, 1 month ago

google-deepmind / gemma-2-2b
Gemma2 2b by Google
Updated 1 year, 1 month ago

ydideh810 / cosmo-speak
A chat-bot that specialises in Space/Aeronautics knowledge.
Updated 1 year, 1 month ago

lucataco / moondream2
moondream2 is a small vision language model designed to run efficiently on edge devices
Updated 1 year, 1 month ago

meta / meta-llama-3.1-405b-instruct
Meta's flagship 405 billion parameter language model, fine-tuned for chat completions
Updated 1 year, 1 month ago

deniyes / dolly-v2-12b-demo
dolly-v2-12b, just for testing
Updated 1 year, 1 month ago

microsoft / phi-3-medium-4k-instruct
A 14B parameter, lightweight, state-of-the-art open model trained with the Phi-3 datasets that includes both synthetic data and the filtered publicly available websites data with a focus on high-quality and reasoning dense pro
Updated 1 year, 1 month ago

yorickvp / llava-13b
Visual instruction tuning towards large language and vision models with GPT-4 level capabilities
Updated 1 year, 1 month ago

lucataco / numinamath-7b-tir
NuminaMath is a series of language models that are trained to solve math problems using tool-integrated reasoning (TIR)
Updated 1 year, 2 months ago

lucataco / ollama-llama3-70b
Cog wrapper for Ollama llama3:70b
Updated 1 year, 2 months ago

lucataco / ollama-llama3-8b
Cog wrapper for Ollama llama3:8b
Updated 1 year, 2 months ago

deepseek-ai / deepseek-coder-v2-lite-instruct
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Updated 1 year, 2 months ago

lucataco / internlm2_5-7b-chat
InternLM2.5 has open-sourced a 7 billion parameter base model and a chat model tailored for practical scenarios.
Updated 1 year, 2 months ago

lorenzomarines / nucleum-nano-30b
Updated 1 year, 2 months ago

microsoft / phi-3-mini-4k-instruct
Phi-3-Mini-4K-Instruct is a 3.8B parameters, lightweight, state-of-the-art open model trained with the Phi-3 datasets
Updated 1 year, 2 months ago

lucataco / qwen2-57b-a14b-instruct
Qwen2 57 billion parameter language model from Alibaba Cloud, fine tuned for chat completions
Updated 1 year, 2 months ago

cuuupid / glm-4v-9b
GLM-4V is a multimodal model released by Tsinghua University that is competitive with GPT-4o and establishes a new SOTA on several benchmarks, including OCR.
Updated 1 year, 2 months ago

lucataco / dolphin-2.9-llama3-8b
Dolphin-2.9 has a variety of instruction, conversational, and coding skills. It also has initial agentic abilities and supports function calling
Updated 1 year, 2 months ago

lucataco / hermes-2-pro-llama-3-70b
Hermes 2 Pro is an updated and cleaned version of the OpenHermes 2.5 Dataset, as well as a newly introduced Function Calling and JSON Mode dataset developed in-house
Updated 1 year, 2 months ago

lucataco / hermes-2-theta-llama-3-8b
Hermes-2 Θ (Theta) is the first experimental merged model released by Nous Research
Updated 1 year, 2 months ago

lucataco / hermes-2-pro-llama-3-8b
Hermes 2 Pro is an updated and cleaned version of the OpenHermes 2.5 Dataset, as well as a newly introduced Function Calling and JSON Mode dataset developed in-house
Updated 1 year, 2 months ago

google-deepmind / gemma2-27b-it
Google's Gemma2 27b instruct model
Updated 1 year, 2 months ago

google-deepmind / gemma2-9b-it
Google's Gemma2 9b instruct model
Updated 1 year, 2 months ago

zsxkib / qwen2-7b-instruct
Qwen 2: A 7 billion parameter language model from Alibaba Cloud, fine tuned for chat completions
Updated 1 year, 2 months ago

zsxkib / qwen2-1.5b-instruct
Qwen 2: A 1.5 billion parameter language model from Alibaba Cloud, fine tuned for chat completions
Updated 1 year, 2 months ago

zsxkib / qwen2-0.5b-instruct
Qwen 2: A 0.5 billion parameter language model from Alibaba Cloud, fine tuned for chat completions
Updated 1 year, 2 months ago

hayooucom / vision-llama3
for test
Updated 1 year, 3 months ago

hayooucom / vision-model
This is phi-3-vision model , cost by time ,have fun~
Updated 1 year, 3 months ago

johnnyoshika / llama2-combine-numbers
Updated 1 year, 3 months ago

lucataco / yi-1.5-6b
Yi-1.5 is continuously pre-trained on Yi with a high-quality corpus of 500B tokens and fine-tuned on 3M diverse fine-tuning samples
Updated 1 year, 4 months ago

mikeei / dolphin-2.9.1-llama3-8b-gguf
Dolphin is uncensored. I have filtered the dataset to remove alignment and bias. This makes the model more compliant.
Updated 1 year, 4 months ago

mikeei / dolphin-2.9-llama3-70b-gguf
Dolphin is uncensored. I have filtered the dataset to remove alignment and bias. This makes the model more compliant.
Updated 1 year, 4 months ago

mikeei / dolphin-2.9-llama3-8b-gguf
Dolphin is uncensored. I have filtered the dataset to remove alignment and bias. This makes the model more compliant.
Updated 1 year, 4 months ago

deepseek-ai / deepseek-67b-base
DeepSeek LLM, an advanced language model comprising 67 billion parameters. Trained from scratch on a vast dataset of 2 trillion tokens in both English and Chinese
Updated 1 year, 4 months ago

lucataco / qwen1.5-110b
Qwen1.5 is the beta version of Qwen2, a transformer-based decoder-only language model pretrained on a large amount of data
Updated 1 year, 4 months ago

hayooucom / llm-60k
llm model ,for CN
Updated 1 year, 4 months ago

microsoft / phi-3-mini-128k-instruct
Phi-3-Mini-128K-Instruct is a 3.8 billion-parameter, lightweight, state-of-the-art open model trained using the Phi-3 datasets
Updated 1 year, 4 months ago

snowflake / snowflake-arctic-instruct
An efficient, intelligent, and truly open-source language model
Updated 1 year, 4 months ago

meta / meta-llama-3-70b
Base version of Llama 3, a 70 billion parameter language model from Meta.
Updated 1 year, 4 months ago

meta / meta-llama-3-70b-instruct
A 70 billion parameter language model from Meta, fine tuned for chat completions
Updated 1 year, 4 months ago

meta / meta-llama-3-8b-instruct
An 8 billion parameter language model from Meta, fine tuned for chat completions
Updated 1 year, 4 months ago

meta / meta-llama-3-8b
Base version of Llama 3, an 8 billion parameter language model from Meta.
Updated 1 year, 4 months ago

camenduru / zephyr-orpo-141b-a35b-v0.1
Mixtral 8x22b v0.1 Zephyr Orpo 141b A35b v0.1
Updated 1 year, 4 months ago

spuuntries / erosumika-7b-v3-0.2-gguf
localfultonextractor's Erosumika 7B Mistral Merge, GGUF Q4_K_S-imat quantized by Lewdiculous.
Updated 1 year, 5 months ago

hikikomori-haven / solar-uncensored
Updated 1 year, 5 months ago

cjwbw / starcoder2-15b
Language Models for Code
Updated 1 year, 5 months ago

martintmv-git / moondream2
small vision language model
Updated 1 year, 5 months ago

deepseek-ai / deepseek-vl-7b-base
DeepSeek-VL: An open-source Vision-Language Model designed for real-world vision and language understanding applications
Updated 1 year, 6 months ago

halevi / sandbox1
Updated 1 year, 6 months ago

ignaciosgithub / pllava
Updated 1 year, 6 months ago

cjwbw / opencodeinterpreter-ds-6.7b
OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement
Updated 1 year, 6 months ago

google-deepmind / gemma-7b
7B base version of Google’s Gemma model
Updated 1 year, 6 months ago

google-deepmind / gemma-2b
2B base version of Google’s Gemma model
Updated 1 year, 6 months ago

google-deepmind / gemma-7b-it
7B instruct version of Google’s Gemma model
Updated 1 year, 6 months ago

google-deepmind / gemma-2b-it
2B instruct version of Google’s Gemma model
Updated 1 year, 6 months ago

spuuntries / miqumaid-v2-2x70b-dpo-gguf
NeverSleep's MiquMaid v2 2x70B Miqu-Mixtral MoE DPO Finetune, GGUF Q2_K quantized by NeverSleep.
Updated 1 year, 6 months ago

deepseek-ai / deepseek-math-7b-instruct
Pushing the Limits of Mathematical Reasoning in Open Language Models - Instruct model
Updated 1 year, 7 months ago

deepseek-ai / deepseek-math-7b-base
Pushing the Limits of Mathematical Reasoning in Open Language Models - Base model
Updated 1 year, 7 months ago

nateraw / defog-sqlcoder-7b-2
A capable large language model for natural language to SQL generation.
Updated 1 year, 7 months ago

lucataco / phixtral-2x2_8
phixtral-2x2_8 is the first Mixure of Experts (MoE) made with two microsoft/phi-2 models, inspired by the mistralai/Mixtral-8x7B-v0.1 architecture
Updated 1 year, 7 months ago

nateraw / sqlcoder-70b-alpha
Updated 1 year, 7 months ago

lucataco / qwen1.5-72b
Qwen1.5 is the beta version of Qwen2, a transformer-based decoder-only language model pretrained on a large amount of data
Updated 1 year, 7 months ago

lucataco / qwen1.5-7b
Qwen1.5 is the beta version of Qwen2, a transformer-based decoder-only language model pretrained on a large amount of data
Updated 1 year, 7 months ago

lucataco / qwen1.5-4b
Qwen1.5 is the beta version of Qwen2, a transformer-based decoder-only language model pretrained on a large amount of data
Updated 1 year, 7 months ago

lucataco / qwen1.5-1.8b
Qwen1.5 is the beta version of Qwen2, a transformer-based decoder-only language model pretrained on a large amount of data
Updated 1 year, 7 months ago

lucataco / qwen1.5-0.5b
Qwen1.5 is the beta version of Qwen2, a transformer-based decoder-only language model pretrained on a large amount of data
Updated 1 year, 7 months ago

spuuntries / miqumaid-v1-70b-gguf
NeverSleep's MiquMaid v1 70B Miqu Finetune, GGUF Q3_K_M quantized by NeverSleep.
Updated 1 year, 7 months ago

adirik / mamba-2.8b
Base version of Mamba 2.8B, a 2.8 billion parameter state space language model
Updated 1 year, 7 months ago

adirik / mamba-130m
Base version of Mamba 130M, a 130 million parameter state space language model
Updated 1 year, 7 months ago

adirik / mamba-370m
Base version of Mamba 370M, a 370 million parameter state space language model
Updated 1 year, 7 months ago

adirik / mamba-790m
Base version of Mamba 790M, a 790 million parameter state space language model
Updated 1 year, 7 months ago

adirik / mamba-2.8b-slimpj
Base version of Mamba 2.8B Slim Pyjama, a 2.8 billion parameter state space language model
Updated 1 year, 7 months ago

adirik / mamba-1.4b
Base version of Mamba 1.4B, a 1.4 billion parameter state space language model
Updated 1 year, 7 months ago

yorickvp / llava-v1.6-vicuna-13b
LLaVA v1.6: Large Language and Vision Assistant (Vicuna-13B)
Updated 1 year, 7 months ago

yorickvp / llava-v1.6-mistral-7b
LLaVA v1.6: Large Language and Vision Assistant (Mistral-7B)
Updated 1 year, 7 months ago

meta / codellama-70b-instruct
A 70 billion parameter Llama tuned for coding and conversation
Updated 1 year, 7 months ago

meta / codellama-70b-python
A 70 billion parameter Llama tuned for coding with Python
Updated 1 year, 7 months ago

spuuntries / flatdolphinmaid-8x7b-gguf
Undi95's FlatDolphinMaid 8x7B Mixtral Merge, GGUF Q5_K_M quantized by TheBloke.
Updated 1 year, 7 months ago

msamogh / iiu-generator-llama2-7b-2
Updated 1 year, 7 months ago

dsingal0 / mixtral-single-gpu
Runs Mixtral 8x7B on a single A40 GPU
Updated 1 year, 7 months ago

lucataco / moondream1
(Research only) Moondream1 is a vision language model that performs on par with models twice its size
Updated 1 year, 7 months ago

spuuntries / borealis-10.7b-dpo-gguf
Undi95's Borealis 10.7B Mistral DPO Finetune, GGUF Q5_K_M quantized by Undi95.
Updated 1 year, 7 months ago

lucataco / wizardcoder-33b-v1.1-gguf
WizardCoder: Empowering Code Large Language Models with Evol-Instruct
Updated 1 year, 7 months ago

kcaverly / neuralbeagle14-7b-gguf
NeuralBeagle14-7B is (probably) the best 7B model you can find!
Updated 1 year, 7 months ago

organisciak / ocsai-llama2-7b
Updated 1 year, 8 months ago

kcaverly / nous-capybara-34b-gguf
A SOTA Nous Research finetune of 200k Yi-34B fine tuned on the Capybara dataset.
Updated 1 year, 8 months ago

meta / codellama-34b-instruct
A 34 billion parameter Llama tuned for coding and conversation
Updated 1 year, 8 months ago

meta / codellama-7b-instruct
A 7 billion parameter Llama tuned for coding and conversation
Updated 1 year, 8 months ago

nateraw / axolotl-llama-2-7b-english-to-hinglish
Updated 1 year, 8 months ago

hamelsmu / honeycomb
Honeycomb NLQ Generator
Updated 1 year, 8 months ago

lucataco / tinyllama-1.1b-chat-v1.0
This is the chat model finetuned on top of TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T
Updated 1 year, 8 months ago

nateraw / nous-hermes-2-solar-10.7b
Nous Hermes 2 - SOLAR 10.7B is the flagship Nous Research model on the SOLAR 10.7B base model..
Updated 1 year, 8 months ago

kcaverly / nous-hermes-2-solar-10.7b-gguf
Nous Hermes 2 - SOLAR 10.7B is the flagship Nous Research model on the SOLAR 10.7B base model.
Updated 1 year, 8 months ago

kcaverly / nous-hermes-2-yi-34b-gguf
Nous Hermes 2 - Yi-34B is a state of the art Yi Fine-tune, fine tuned on GPT-4 generated synthetic data
Updated 1 year, 8 months ago

intentface / poro-34b-gguf-checkpoint
Try out akx/Poro-34B-gguf, Q5_K, This is 1000B checkpoint model
Updated 1 year, 8 months ago

kcaverly / openchat-3.5-1210-gguf
The "Overall Best Performing Open Source 7B Model" for Coding + Generalization or Mathematical Reasoning
Updated 1 year, 8 months ago

lidarbtc / kollava-v1.5
korean version of llava-v1.5
Updated 1 year, 8 months ago

kcaverly / phind-codellama-34b-v2-gguf
A quantized 34B parameter language model from Phind for code completion
Updated 1 year, 9 months ago

kcaverly / nexus-raven-v2-13b-gguf
A quantized 13B parameter language model from NexusFlow for SOTA zero-shot function calling
Updated 1 year, 9 months ago

kcaverly / deepseek-coder-33b-instruct-gguf
A quantized 33B parameter language model from Deepseek for SOTA repository level code completion
Updated 1 year, 9 months ago

rybens92 / una-cybertron-7b-v2--lmtp-8bit
Updated 1 year, 9 months ago

kcaverly / deepseek-coder-6.7b-instruct
A ~7B parameter language model from Deepseek for SOTA repository level code completion
Updated 1 year, 9 months ago

andreasjansson / plasma
Generate plasma shader equations
Updated 1 year, 9 months ago

titocosta / notus-7b-v1
Notus-7b-v1 model
Updated 1 year, 9 months ago

titocosta / starling
Starling-LM-7B-alpha
Updated 1 year, 9 months ago

chigozienri / llava-birds
Updated 1 year, 9 months ago

nateraw / llama-2-7b-samsum
Updated 1 year, 9 months ago

mattt / orca-2-13b
Updated 1 year, 9 months ago

01-ai / yi-34b-chat
The Yi series models are large language models trained from scratch by developers at 01.AI.
Updated 1 year, 9 months ago

01-ai / yi-6b-chat
The Yi series models are large language models trained from scratch by developers at 01.AI.
Updated 1 year, 9 months ago

antoinelyset / openhermes-2-mistral-7b
Simple version of https://huggingface.co/teknium/OpenHermes-2-Mistral-7B
Updated 1 year, 9 months ago

nateraw / goliath-120b
An auto-regressive causal LM created by combining 2x finetuned Llama-2 70B into one.
Updated 1 year, 9 months ago

nateraw / llama-2-7b-paraphrase-v1
Updated 1 year, 9 months ago

01-ai / yi-34b-200k
The Yi series models are large language models trained from scratch by developers at 01.AI.
Updated 1 year, 9 months ago

01-ai / yi-34b
The Yi series models are large language models trained from scratch by developers at 01.AI.
Updated 1 year, 9 months ago

01-ai / yi-6b
The Yi series models are large language models trained from scratch by developers at 01.AI.
Updated 1 year, 9 months ago

nateraw / openchat_3.5-awq
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
Updated 1 year, 10 months ago

antoinelyset / openhermes-2-mistral-7b-awq
Updated 1 year, 10 months ago

antoinelyset / openhermes-2.5-mistral-7b-awq
Updated 1 year, 10 months ago

antoinelyset / openhermes-2.5-mistral-7b
Updated 1 year, 10 months ago

meta / llama-2-7b-chat
A 7 billion parameter language model from Meta, fine tuned for chat completions
Updated 1 year, 10 months ago

nateraw / zephyr-7b-beta
Zephyr-7B-beta, an LLM trained to act as a helpful assistant.
Updated 1 year, 10 months ago

lucataco / dolphin-2.2.1-mistral-7b
Mistral-7B-v0.1 fine tuned for chat with the Dolphin dataset (an open-source implementation of Microsoft's Orca)
Updated 1 year, 10 months ago

lucataco / dolphin-2.1-mistral-7b
Mistral-7B-v0.1 fine tuned for chat with the Dolphin dataset (an open-source implementation of Microsoft's Orca)
Updated 1 year, 10 months ago

joehoover / falcon-40b-instruct
A 40 billion parameter language model trained to follow human instructions.
Updated 1 year, 10 months ago

peter65374 / openbuddy-llemma-34b-gguf
This is a cog implementation of "openbuddy-llemma-34b" 4-bit quantization model.
Updated 1 year, 10 months ago

nomagick / chatglm3-6b
A 6B parameter open bilingual chat LLM | 开源双语对话语言模型
Updated 1 year, 10 months ago

nomagick / chatglm3-6b-32k
A 6B parameter open bilingual chat LLM (optimized for 8k+ context) | 开源双语对话语言模型
Updated 1 year, 10 months ago

peter65374 / openbuddy-mistral-7b
Openbuddy finetuned mistral-7b in GPTQ quantization in 4bits by TheBloke
Updated 1 year, 10 months ago

nomagick / qwen-vl-chat
Qwen-VL-Chat but with raw ChatML prompt interface and streaming
Updated 1 year, 10 months ago

nateraw / nous-hermes-llama2-awq
TheBloke/Nous-Hermes-Llama2-AWQ served with vLLM
Updated 1 year, 10 months ago

nateraw / mistral-7b-openorca
Mistral-7B-v0.1 fine tuned for chat with the OpenOrca dataset.
Updated 1 year, 11 months ago

joehoover / zephyr-7b-alpha
A high-performing language model trained to act as a helpful assistant
Updated 1 year, 11 months ago

nomagick / qwen-14b-chat
Qwen-14B-Chat is a Transformer-based large language model, which is pretrained on a large volume of data, including web texts, books, codes, etc.
Updated 1 year, 11 months ago

papermoose / llama-pajama
Updated 1 year, 11 months ago

mistralai / mistral-7b-v0.1
A 7 billion parameter language model from Mistral.
Updated 1 year, 11 months ago

meta / codellama-13b
A 13 billion parameter Llama tuned for code completion
Updated 1 year, 11 months ago

nateraw / codellama-34b
Updated 1 year, 11 months ago

nateraw / codellama-13b
Updated 1 year, 11 months ago

nateraw / codellama-7b
Updated 1 year, 11 months ago

nateraw / codellama-13b-instruct
Updated 1 year, 11 months ago

nateraw / codellama-7b-instruct
Updated 1 year, 11 months ago

moinnadeem / codellama-34b-instruct-vllm
Updated 1 year, 11 months ago

moinnadeem / vllm-engine-llama-7b
Updated 1 year, 11 months ago

nateraw / gairmath-abel-7b
Updated 1 year, 11 months ago

andreasjansson / wizardcoder-python-34b-v1-gguf
WizardCoder-python-34B-v1.0 with support for grammars and jsonschema
Updated 1 year, 11 months ago

andreasjansson / codellama-34b-instruct-gguf
CodeLlama-34B-instruct with support for grammars and jsonschema
Updated 1 year, 11 months ago

xrunda / med
Updated 1 year, 11 months ago

andreasjansson / llama-2-13b-gguf
Llama-2 13B with support for grammars and jsonschema
Updated 1 year, 11 months ago

nateraw / codellama-7b-instruct-hf
Updated 1 year, 11 months ago

nateraw / aidc-ai-business-marcoroni-13b
Updated 1 year, 11 months ago

nwhitehead / llama2-7b-chat-gptq
Updated 1 year, 11 months ago

joehoover / sql-generator
Updated 1 year, 11 months ago

meta / llama-2-70b-chat
A 70 billion parameter language model from Meta, fine tuned for chat completions
Updated 2 years ago

meta / llama-2-70b
Base version of Llama 2, a 70 billion parameter language model from Meta.
Updated 2 years ago

meta / llama-2-13b-chat
A 13 billion parameter language model from Meta, fine tuned for chat completions
Updated 2 years ago

meta / llama-2-13b
Base version of Llama 2 13B, a 13 billion parameter language model
Updated 2 years ago

meta / llama-2-7b
Base version of Llama 2 7B, a 7 billion parameter language model
Updated 2 years ago

fofr / neuromancer-13b
llama-13b-base fine-tuned on Neuromancer style
Updated 2 years ago

cbh123 / dylan-lyrics
Llama 2 13B fine-tuned on Bob Dylan lyrics
Updated 2 years ago

fofr / prompt-classifier
Determines the toxicity of text to image prompts, llama-13b fine-tune. [SAFETY_RANKING] between 0 (safe) and 10 (toxic)
Updated 2 years ago

niron1 / llama-2-7b-chat
LLAMA-2 7b chat version by Meta. Stream support. Unaltered prompt. Temperature working properly. Economical hardware.
Updated 2 years ago

meta / codellama-13b-instruct
A 13 billion parameter Llama tuned for coding and conversation
Updated 2 years ago

meta / codellama-13b-python
A 13 billion parameter Llama tuned for coding with Python
Updated 2 years ago

meta / codellama-7b-python
A 7 billion parameter Llama tuned for coding with Python
Updated 2 years ago

meta / codellama-7b
A 7 billion parameter Llama tuned for coding and conversation
Updated 2 years ago

nateraw / samsum-llama-7b
llama-2-7b fine-tuned on the samsum dataset for dialogue summarization
Updated 2 years ago

uwulewd / airoboros-llama-2-70b
Inference Airoboros L2 70B 2.1 - GPTQ using ExLlama.
Updated 2 years ago

cbh123 / samsum
Updated 2 years ago

fofr / llama2-prompter
Llama2 13b base model fine-tuned on text to image prompts
Updated 2 years ago

nateraw / samsum-llama-2-13b
Updated 2 years ago

cbh123 / homerbot
Updated 2 years ago

divyavanmahajan / my-pet-llama
Updated 2 years ago

zallesov / super-real-llama2
Updated 2 years ago

nateraw / wizardcoder-python-34b-v1.0
Updated 2 years ago

seanoliver / bob-dylan-fun-tuning
Llama fine-tune-athon project training llama2 on bob dylan lyrics.
Updated 2 years ago

meta / codellama-34b
A 34 billion parameter Llama tuned for coding and conversation
Updated 2 years ago

gregwdata / defog-sqlcoder-q8
Defog's SQLCoder is a state-of-the-art LLM for converting natural language questions to SQL queries. SQLCoder is a 15B parameter fine-tuned on a base StarCoder model.
Updated 2 years ago

nateraw / llama-2-7b-chat-hf
Updated 2 years ago

niron1 / openorca-platypus2-13b
OpenOrca-Platypus2-13B is a merge of garage-bAInd/Platypus2-13B and Open-Orca/OpenOrcaxOpenChat-Preview2-13B.
Updated 2 years ago

niron1 / qwen-7b-chat
Qwen-7B is the 7B-parameter version of the large language model series, Qwen (abbr. Tongyi Qianwen), proposed by Aibaba Cloud. Qwen-7B`is a Transformer-based large language model, which is pretrained on a large volume of data, including web texts, books,
Updated 2 years ago

crowdy / line-lang-3.6b
an implementation of 3.6b Japanese large language model
Updated 2 years ago

charles-dyfis-net / llama-2-13b-hf--lmtp-8bit
Updated 2 years ago

charles-dyfis-net / llama-2-13b-hf--lmtp-4bit
Updated 2 years ago

charles-dyfis-net / llama-2-13b-hf--lmtp
Updated 2 years ago

charles-dyfis-net / llama-2-7b-hf--lmtp-4bit
Updated 2 years ago

nateraw / stablecode-completion-alpha-3b-4k
Updated 2 years, 1 month ago

glavin001 / exllama-airoboros-7b-gpt4-1.4-gptq
Test out fast inference with ExLlama and 4bit quantization!
Updated 2 years, 2 months ago

nomagick / chatglm2-6b
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
Updated 2 years, 2 months ago

nomagick / chatglm2-6b-int4
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型 (int4)
Updated 2 years, 2 months ago

stability-ai / stablelm-base-alpha-3b
3B parameter base version of Stability AI's language model
Updated 2 years, 4 months ago

fofr / image-prompts
Generate image prompts for Midjourney. Prefix inputs with "Image: "
Updated 2 years, 4 months ago

fofr / star-trek-adventure
Updated 2 years, 4 months ago

replicate / flan-t5-xl
A language model by Google for tasks like classification, summarization, and more
Updated 2 years, 4 months ago

fofr / star-trek-llama
llama-7b trained on the Memory Alpha Star Trek Wiki
Updated 2 years, 5 months ago

fofr / star-trek-gpt-j-6b
gpt-j-6b trained on the Memory Alpha Star Trek Wiki
Updated 2 years, 5 months ago

fofr / star-trek-flan
flan-t5-xl trained on the Memory Alpha Star Trek Wiki
Updated 2 years, 5 months ago

replicate / gpt-j-6b
A large language model by EleutherAI
Updated 2 years, 5 months ago

daanelson / flan-t5-large
A language model for tasks like classification, summarization, and more.
Updated 2 years, 5 months ago