tomasmcm / neural-chat-7b-v3-1
Source: Intel/neural-chat-7b-v3-1 ✦ Quant: TheBloke/neural-chat-7B-v3-1-AWQ ✦ Fine-tuned model based on mistralai/Mistral-7B-v0.1
Run time and cost
This model runs on Nvidia A40 GPU hardware. Predictions typically complete within 3 seconds. The predict time for this model varies significantly based on the inputs.