Failed to load versions. Head to the versions page to see all versions for this model.
You're looking at a specific version of this model. Jump to the model overview.
nomagick /chatglm2-6b-int4:ea3715b3
Input
Run this model in Node.js with one line of code:
npm install replicate
REPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
import Replicate from "replicate";
const replicate = new Replicate({
auth: process.env.REPLICATE_API_TOKEN,
});
Run nomagick/chatglm2-6b-int4 using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run(
"nomagick/chatglm2-6b-int4:ea3715b3c4561f1e7c2a7db5873cf9831a7a6c56a6910f7276d17e56b08ef4a9",
{
input: {
top_p: 0.8,
prompt: "[Round 1]\n\n问:请使用英文重复这段话:\"为了使模型生成最优输出,当使用 ChatGLM2-6B 时需要使用特定的输入格式,请按照示例格式组织输入。\"\n\n答:",
max_tokens: 2048,
temperature: 0.75
}
}
);
console.log(output);
To learn more, take a look at the guide on getting started with Node.js.
pip install replicate
REPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
import replicate
Run nomagick/chatglm2-6b-int4 using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run(
"nomagick/chatglm2-6b-int4:ea3715b3c4561f1e7c2a7db5873cf9831a7a6c56a6910f7276d17e56b08ef4a9",
input={
"top_p": 0.8,
"prompt": "[Round 1]\n\n问:请使用英文重复这段话:\"为了使模型生成最优输出,当使用 ChatGLM2-6B 时需要使用特定的输入格式,请按照示例格式组织输入。\"\n\n答:",
"max_tokens": 2048,
"temperature": 0.75
}
)
# The nomagick/chatglm2-6b-int4 model can stream output as it's running.
# The predict method returns an iterator, and you can iterate over that output.
for item in output:
# https://replicate.com/nomagick/chatglm2-6b-int4/api#output-schema
print(item, end="")
To learn more, take a look at the guide on getting started with Python.
REPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Run nomagick/chatglm2-6b-int4 using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \
-H "Authorization: Bearer $REPLICATE_API_TOKEN" \
-H "Content-Type: application/json" \
-H "Prefer: wait" \
-d $'{
"version": "ea3715b3c4561f1e7c2a7db5873cf9831a7a6c56a6910f7276d17e56b08ef4a9",
"input": {
"top_p": 0.8,
"prompt": "[Round 1]\\n\\n问:请使用英文重复这段话:\\"为了使模型生成最优输出,当使用 ChatGLM2-6B 时需要使用特定的输入格式,请按照示例格式组织输入。\\"\\n\\n答:",
"max_tokens": 2048,
"temperature": 0.75
}
}' \
https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Add a payment method to run this model.
By signing in, you agree to our
terms of service and privacy policy
Output
{
"completed_at": "2023-07-12T15:52:14.515793Z",
"created_at": "2023-07-12T15:52:08.478189Z",
"data_removed": false,
"error": null,
"id": "f4wseybbl6gqhkhfeoubln3bhe",
"input": {
"top_p": 0.8,
"prompt": "[Round 1]\n\n问:请使用英文重复这段话:\"为了使模型生成最优输出,当使用 ChatGLM2-6B 时需要使用特定的输入格式,请按照示例格式组织输入。\"\n\n答:",
"max_tokens": 2048,
"temperature": 0.75
},
"logs": null,
"metrics": {
"predict_time": 6.06164,
"total_time": 6.037604
},
"output": [
"To",
" achieve",
" the",
" best",
" output",
" from",
" the",
" model",
",",
" when",
" using",
" Chat",
"GL",
"M",
"2",
"-",
"6",
"B",
",",
" specific",
" input",
" format",
"ting",
" is",
" required",
".",
" Please",
" organize",
" the",
" input",
" according",
" to",
" the",
" example",
" format",
".",
""
],
"started_at": "2023-07-12T15:52:08.454153Z",
"status": "succeeded",
"urls": {
"get": "https://api.replicate.com/v1/predictions/f4wseybbl6gqhkhfeoubln3bhe",
"cancel": "https://api.replicate.com/v1/predictions/f4wseybbl6gqhkhfeoubln3bhe/cancel"
},
"version": "ea3715b3c4561f1e7c2a7db5873cf9831a7a6c56a6910f7276d17e56b08ef4a9"
}