(Updated 1 year, 10 months ago)
Runs Mixtral 8x7B on a single A40 GPU
This model is cold. You'll get a fast response if the model is warm and already running, and a slower response if the model is cold and starting up.