charles-dyfis-net / llama-2-7b-hf--lmtp-8bit

  • Public
  • 0 runs
  • L40S
Iterate in playground

Input

Set the REPLICATE_API_TOKEN environment variable:
export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Run charles-dyfis-net/llama-2-7b-hf--lmtp-8bit using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

curl -s -X POST \
  -H "Authorization: Bearer $REPLICATE_API_TOKEN" \
  -H "Content-Type: application/json" \
  -H "Prefer: wait" \
  -d $'{
    "version": "charles-dyfis-net/llama-2-7b-hf--lmtp-8bit:4d752beccdc9a6de67229d983598ea62833caeef855d3de18d563aa2f111dd3c",
    "input": {}
  }' \
  https://api.replicate.com/v1/predictions

To learn more, take a look at Replicate’s HTTP API reference docs.

Output

No output yet! Press "Submit" to start a prediction.

Run time and cost

This model runs on Nvidia L40S GPU hardware. We don't yet have enough runs of this model to provide performance information.

Readme

This model doesn't have a readme.