Readme
This model doesn't have a readme.
REPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Run moinnadeem/llama-2-7b-mlc using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \
-H "Authorization: Bearer $REPLICATE_API_TOKEN" \
-H "Content-Type: application/json" \
-H "Prefer: wait" \
-d $'{
"version": "moinnadeem/llama-2-7b-mlc:1872efa73aedb56644f6c0649d5d8d0fe4fa1c65979806f3c987935c2e14ab97",
"input": {
"debug": false,
"top_k": 0,
"top_p": 0.95,
"temperature": 0.7,
"return_logits": false,
"max_new_tokens": 128,
"min_new_tokens": -1
}
}' \
https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
No output yet! Press "Submit" to start a prediction.
This model runs on Nvidia A100 (80GB) GPU hardware. We don't yet have enough runs of this model to provide performance information.
This model doesn't have a readme.