Examples – glavin001/exllama-airoboros-7b-gpt4-1.4-gptq | Replicate

Test out fast inference with ExLlama and 4bit quantization!

Public

1.7K runs

Playground API Examples README Versions

Want to make some of these yourself?

Run this model