Readme
This model doesn't have a readme.
Llama-2 13B with support for grammars and jsonschema
This model runs on Nvidia A40 (Large) GPU hardware. Predictions typically complete within 6 seconds. The predict time for this model varies significantly based on the inputs.
This model doesn't have a readme.