glavin001/exllama-airoboros-7b-gpt4-1.4-gptq
Test out fast inference with ExLlama and 4bit quantization!
Public
1.7K
runs