Readme
This model doesn't have a readme.
Run this model in Node.js with one line of code:
npm install replicate
REPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
import Replicate from "replicate";
const replicate = new Replicate({
auth: process.env.REPLICATE_API_TOKEN,
});
Run technillogue/llama-2-7b-mlc-nix using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run(
"technillogue/llama-2-7b-mlc-nix:17afdccdc015160912760cdc3ab417d35df91da7a291f2232d52680cd3b78d98",
{
input: {
debug: false,
top_p: 0.95,
temperature: 0.7,
max_new_tokens: 128,
min_new_tokens: -1,
repetition_penalty: 1.15
}
}
);
console.log(output);
To learn more, take a look at the guide on getting started with Node.js.
No output yet! Press "Submit" to start a prediction.
This model runs on Nvidia A100 (80GB) GPU hardware. We don't yet have enough runs of this model to provide performance information.
This model doesn't have a readme.