nateraw/nous-hermes-llama2-awq – Run with an API on Replicate

TheBloke/Nous-Hermes-Llama2-AWQ served with vLLM

This model runs on Nvidia L40S GPU hardware. We don't yet have enough runs of this model to provide performance information.

See the official model card for more information about this model: https://huggingface.co/TheBloke/Nous-Hermes-Llama2-AWQ

Thank you to TheBloke for sharing this model!