Official

meta / llama-2-70b

Base version of Llama 2, a 70 billion parameter language model from Meta.

  • Public
  • 351.8K runs
  • GitHub
  • Paper
  • License

Pricing

Official model
Pricing for official models works differently from other models. Instead of being billed by time, you’re billed by input and output, making pricing more predictable.

This language model is priced by how many input tokens are sent as inputs and how many output tokens are generated.

Check out our docs for more information about how per-token pricing works on Replicate.