tomasmcm / neural-chat-7b-v3-1

Source: Intel/neural-chat-7b-v3-1 ✦ Quant: TheBloke/neural-chat-7B-v3-1-AWQ ✦ Fine-tuned model based on mistralai/Mistral-7B-v0.1

Demo API Examples README Versions (acb45049)

Run time and cost

This model runs on Nvidia A40 GPU hardware. Predictions typically complete within 3 seconds. The predict time for this model varies significantly based on the inputs.