lucataco / llama-2-13b-chat

Meta's Llama 2 13b Chat - GPTQ

Demo API Examples README Versions (18f253bf)

Run time and cost

This model runs on Nvidia A40 (Large) GPU hardware. Predictions typically complete within 7 seconds.