Readme
This model doesn't have a readme.
Run this model in Node.js with one line of code:
npm install replicate
REPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
import Replicate from "replicate";
const replicate = new Replicate({
auth: process.env.REPLICATE_API_TOKEN,
});
Run moinnadeem/fasterllama using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run(
"moinnadeem/fasterllama:4eae7c9b8ff26bb88844aa494b7318b46eeee6187f66bad7217c95f05a1466bf",
{
input: {
top_p: 1,
max_length: 500,
temperature: 0.75,
repetition_penalty: 1
}
}
);
console.log(output);
To learn more, take a look at the guide on getting started with Node.js.
No output yet! Press "Submit" to start a prediction.
This model runs on Nvidia A100 (80GB) GPU hardware. We don't yet have enough runs of this model to provide performance information.
This model doesn't have a readme.