nateraw / goliath-120b

An auto-regressive causal LM created by combining 2x finetuned Llama-2 70B into one.

  • Public
  • 175.8K runs

Input

Output

Run time and cost

This model runs on Nvidia A100 (80GB) GPU hardware. Predictions typically complete within 8 seconds.

Readme

See the full model card here

The model we are using here is the AWQ quantized version, which can be found here