glavin001/exllama-airoboros-7b-gpt4-1.4-gptq

Test out fast inference with ExLlama and 4bit quantization!

Public
1.7K runs

Want to make some of these yourself?

Run this model