Readme
Read the official model card here: https://huggingface.co/TheBloke/wizard-mega-13B-AWQ
Thank you to TheBloke for sharing this model!
wizard-mega-13b quantized with AWQ and served with vLLM
This model runs on Nvidia A40 (Large) GPU hardware. Predictions typically complete within 3 seconds.
Read the official model card here: https://huggingface.co/TheBloke/wizard-mega-13B-AWQ
Thank you to TheBloke for sharing this model!