Failed to load versions. Head to the versions page to see all versions for this model.
You're looking at a specific version of this model. Jump to the model overview.
kcaverly /openchat-3.5-1210-gguf:0d142640
Input
Run this model in Node.js with one line of code:
npm install replicate
REPLICATE_API_TOKEN
environment variableexport REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
import Replicate from "replicate";
const replicate = new Replicate({
auth: process.env.REPLICATE_API_TOKEN,
});
Run kcaverly/openchat-3.5-1210-gguf using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run(
"kcaverly/openchat-3.5-1210-gguf:0d1426400ae23540eef130c0cd6cbd7184ac47cffee9dfd16fdf02d065df123b",
{
input: {
prompt: "Sally (a girl) has 3 brothers. Each brother has 2 sisters. How many sisters does Sally have?",
temperature: 0.7,
max_new_tokens: -1,
repeat_penalty: 1.1,
prompt_template: "GPT Correct User: {prompt}<|end_of_turn|>GPT4 Correct Assistant: "
}
}
);
console.log(output);
To learn more, take a look at the guide on getting started with Node.js.
pip install replicate
REPLICATE_API_TOKEN
environment variableexport REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
import replicate
Run kcaverly/openchat-3.5-1210-gguf using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run(
"kcaverly/openchat-3.5-1210-gguf:0d1426400ae23540eef130c0cd6cbd7184ac47cffee9dfd16fdf02d065df123b",
input={
"prompt": "Sally (a girl) has 3 brothers. Each brother has 2 sisters. How many sisters does Sally have?",
"temperature": 0.7,
"max_new_tokens": -1,
"repeat_penalty": 1.1,
"prompt_template": "GPT Correct User: {prompt}<|end_of_turn|>GPT4 Correct Assistant: "
}
)
# The kcaverly/openchat-3.5-1210-gguf model can stream output as it's running.
# The predict method returns an iterator, and you can iterate over that output.
for item in output:
# https://replicate.com/kcaverly/openchat-3.5-1210-gguf/api#output-schema
print(item, end="")
To learn more, take a look at the guide on getting started with Python.
REPLICATE_API_TOKEN
environment variableexport REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Run kcaverly/openchat-3.5-1210-gguf using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \
-H "Authorization: Bearer $REPLICATE_API_TOKEN" \
-H "Content-Type: application/json" \
-H "Prefer: wait" \
-d $'{
"version": "0d1426400ae23540eef130c0cd6cbd7184ac47cffee9dfd16fdf02d065df123b",
"input": {
"prompt": "Sally (a girl) has 3 brothers. Each brother has 2 sisters. How many sisters does Sally have?",
"temperature": 0.7,
"max_new_tokens": -1,
"repeat_penalty": 1.1,
"prompt_template": "GPT Correct User: {prompt}<|end_of_turn|>GPT4 Correct Assistant: "
}
}' \
https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Add a payment method to run this model.
By signing in, you agree to our
terms of service and privacy policy
Output
{
"completed_at": "2023-12-19T04:30:46.777184Z",
"created_at": "2023-12-19T04:29:21.632749Z",
"data_removed": false,
"error": null,
"id": "ty6uaftbtcbfqavo6yydnpw3py",
"input": {
"prompt": "Sally (a girl) has 3 brothers. Each brother has 2 sisters. How many sisters does Sally have?",
"temperature": 0.7,
"max_new_tokens": -1,
"repeat_penalty": 1.1,
"prompt_template": "GPT Correct User: {prompt}<|end_of_turn|>GPT4 Correct Assistant: "
},
"logs": "llama_print_timings: load time = 155.21 ms\nllama_print_timings: sample time = 5.26 ms / 38 runs ( 0.14 ms per token, 7218.84 tokens per second)\nllama_print_timings: prompt eval time = 155.11 ms / 40 tokens ( 3.88 ms per token, 257.88 tokens per second)\nllama_print_timings: eval time = 570.10 ms / 37 runs ( 15.41 ms per token, 64.90 tokens per second)\nllama_print_timings: total time = 794.99 ms",
"metrics": {
"predict_time": 0.859201,
"total_time": 85.144435
},
"output": [
"\n",
"Since",
" Sally",
" is",
" the",
" only",
" girl",
" in",
" her",
" family",
",",
" she",
" must",
" be",
" considered",
" as",
" one",
" of",
" the",
" \"",
"s",
"ister",
"s",
"\"",
" mentioned",
".",
" Therefore",
",",
" Sally",
" has",
" ",
"2",
" sisters",
" (",
"including",
" herself",
").",
""
],
"started_at": "2023-12-19T04:30:45.917983Z",
"status": "succeeded",
"urls": {
"stream": "https://streaming-api.svc.us.c.replicate.net/v1/predictions/ty6uaftbtcbfqavo6yydnpw3py",
"get": "https://api.replicate.com/v1/predictions/ty6uaftbtcbfqavo6yydnpw3py",
"cancel": "https://api.replicate.com/v1/predictions/ty6uaftbtcbfqavo6yydnpw3py/cancel"
},
"version": "0d1426400ae23540eef130c0cd6cbd7184ac47cffee9dfd16fdf02d065df123b"
}
llama_print_timings: load time = 155.21 ms
llama_print_timings: sample time = 5.26 ms / 38 runs ( 0.14 ms per token, 7218.84 tokens per second)
llama_print_timings: prompt eval time = 155.11 ms / 40 tokens ( 3.88 ms per token, 257.88 tokens per second)
llama_print_timings: eval time = 570.10 ms / 37 runs ( 15.41 ms per token, 64.90 tokens per second)
llama_print_timings: total time = 794.99 ms