meta / llama-2-70b

Base version of Llama 2, a 70 billion parameter language model from Meta.

Run time and cost

Predictions run on Nvidia A100 (80GB) GPU hardware. Predictions typically complete within 6 seconds.

Llama 2 is a collection of pretrained and fine-tuned generative text models ranging in scale from 7 billion to 70 billion parameters. This is the repository for the 70 billion parameter base model, which has not been fine-tuned.

Learn more about running Llama 2 with an API and the different models.

Please see for more information about the model, licensing, and acceptable use.