typetext
{
"image": "https://raw.githubusercontent.com/zsxkib/cog-kimi-vl-a3b-thinking/main/images/demo1.jpeg",
"max_length_tokens": 2048,
"prompt": "Where am I?",
"temperature": 0.6,
"top_p": 1
}npm install replicate
REPLICATE_API_TOKEN environment variable:export REPLICATE_API_TOKEN=r8_JHk**********************************
This is your API token. Keep it to yourself.
import Replicate from "replicate";
const replicate = new Replicate({
auth: process.env.REPLICATE_API_TOKEN,
});
Run zsxkib/kimi-vl-a3b-thinking using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run(
"zsxkib/kimi-vl-a3b-thinking:c44e6d137b421d27afa9bf4f2360833d4f05a697b549356d44c3a56760a3164d",
{
input: {
image: "https://raw.githubusercontent.com/zsxkib/cog-kimi-vl-a3b-thinking/main/images/demo1.jpeg",
max_length_tokens: 2048,
prompt: "Where am I?",
temperature: 0.6,
top_p: 1
}
}
);
console.log(output);
To learn more, take a look at the guide on getting started with Node.js.
pip install replicate
REPLICATE_API_TOKEN environment variable:export REPLICATE_API_TOKEN=r8_JHk**********************************
This is your API token. Keep it to yourself.
import replicate
Run zsxkib/kimi-vl-a3b-thinking using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run(
"zsxkib/kimi-vl-a3b-thinking:c44e6d137b421d27afa9bf4f2360833d4f05a697b549356d44c3a56760a3164d",
input={
"image": "https://raw.githubusercontent.com/zsxkib/cog-kimi-vl-a3b-thinking/main/images/demo1.jpeg",
"max_length_tokens": 2048,
"prompt": "Where am I?",
"temperature": 0.6,
"top_p": 1
}
)
# The zsxkib/kimi-vl-a3b-thinking model can stream output as it's running.
# The predict method returns an iterator, and you can iterate over that output.
for item in output:
# https://replicate.com/zsxkib/kimi-vl-a3b-thinking/api#output-schema
print(item, end="")
To learn more, take a look at the guide on getting started with Python.
REPLICATE_API_TOKEN environment variable:export REPLICATE_API_TOKEN=r8_JHk**********************************
This is your API token. Keep it to yourself.
Run zsxkib/kimi-vl-a3b-thinking using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \
-H "Authorization: Bearer $REPLICATE_API_TOKEN" \
-H "Content-Type: application/json" \
-H "Prefer: wait" \
-d $'{
"version": "zsxkib/kimi-vl-a3b-thinking:c44e6d137b421d27afa9bf4f2360833d4f05a697b549356d44c3a56760a3164d",
"input": {
"image": "https://raw.githubusercontent.com/zsxkib/cog-kimi-vl-a3b-thinking/main/images/demo1.jpeg",
"max_length_tokens": 2048,
"prompt": "Where am I?",
"temperature": 0.6,
"top_p": 1
}
}' \
https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
◁think▷Okay, let's see. The user is asking "Where am I?" and provided an image of a cyberpunk bar scene. The image has a lot of neon lights, futuristic elements, and people sitting at a bar. There's a sign that says "The Heist" and some text about sitting next to Jackie. Also, there's a mini-map in the corner, which might indicate it's from a video game. First, I need to figure out which game this is. The cyberpunk aesthetic with the bar setting and the mini-map suggests it could be Cyberpunk 2077. The Heist as a location is a known area in that game. The characters' outfits and the overall environment match the game's style. The user might be in-game, so the answer should confirm that they're in Cyberpunk 2077, specifically at The Heist bar. I should mention the game's title and the location's name. Also, note the HUD elements as part of the game's interface. Make sure to keep it concise but informative.◁/think▷You are in **Cyberpunk 2077**, a role-playing video game set in a dystopian future version of Night City. The image shows you inside **The Heist**, a bar or nightclub popular in the game. The cyberpunk aesthetic, neon lighting, and HUD elements (mini-map, objectives like "Sit next to Jackie") align with the game's signature style. This location is known for its vibrant atmosphere and interactions with characters like Jackie Welles, a key figure in the early story.<|im_end|>
{
"id": "0b9g4m3fxxrme0cphf2ajakdq0",
"model": "zsxkib/kimi-vl-a3b-thinking",
"version": "c44e6d137b421d27afa9bf4f2360833d4f05a697b549356d44c3a56760a3164d",
"input": {
"image": "https://raw.githubusercontent.com/zsxkib/cog-kimi-vl-a3b-thinking/main/images/demo1.jpeg",
"max_length_tokens": 2048,
"prompt": "Where am I?",
"temperature": 0.6,
"top_p": 1
},
"logs": "Loaded image: /tmp/tmplw0gw75ldemo1.jpeg\nImage token added to prompt and image object included.\nProcessing conversation and image (if any)...\nStop words: ['<|im_end|>']\nStarting generation...\nconversations = [{'role': 'user', 'content': '<image>\\nWhere am I?', 'images': [<PIL.Image.Image image mode=RGB size=2516x1410 at 0x7BA35CF966D0>]}, {'role': 'assistant', 'content': None}]\nraw text = <|im_system|>system<|im_middle|>You are a helpful assistant<|im_end|><|im_user|>user<|im_middle|><|media_start|>image<|media_content|><|media_pad|><|media_end|><image>\nWhere am I?<|im_end|><|im_assistant|>assistant<|im_middle|>\nGeneration complete.\nFinished prediction. Params: temp=0.6, top_p=1.0, max_len=2048",
"output": [
"",
"",
"",
"",
"",
"",
"",
"◁think▷Okay, ",
"let's ",
"",
"see. ",
"The ",
"user ",
"is ",
"asking ",
"",
"\"Where ",
"am ",
"",
"I?\" ",
"and ",
"provided ",
"an ",
"image ",
"of ",
"a ",
"",
"",
"cyberpunk ",
"bar ",
"",
"scene. ",
"The ",
"image ",
"has ",
"a ",
"lot ",
"of ",
"neon ",
"",
"lights, ",
"futuristic ",
"",
"elements, ",
"and ",
"people ",
"sitting ",
"at ",
"a ",
"",
"bar. ",
"There's ",
"a ",
"sign ",
"that ",
"says ",
"",
"\"The ",
"",
"",
"Heist\" ",
"and ",
"some ",
"text ",
"about ",
"sitting ",
"next ",
"to ",
"",
"Jackie. ",
"",
"Also, ",
"there's ",
"a ",
"",
"mini-map ",
"in ",
"the ",
"",
"corner, ",
"which ",
"might ",
"indicate ",
"it's ",
"from ",
"a ",
"video ",
"game.\n\n",
"",
"",
"First, ",
"I ",
"need ",
"to ",
"figure ",
"out ",
"which ",
"game ",
"this ",
"",
"is. ",
"The ",
"",
"",
"cyberpunk ",
"aesthetic ",
"with ",
"the ",
"bar ",
"setting ",
"and ",
"the ",
"",
"mini-map ",
"suggests ",
"it ",
"could ",
"be ",
"",
"",
"Cyberpunk ",
"",
"",
"",
"2077. ",
"The ",
"",
"Heist ",
"as ",
"a ",
"location ",
"is ",
"a ",
"known ",
"area ",
"in ",
"that ",
"",
"game. ",
"The ",
"",
"characters' ",
"outfits ",
"and ",
"the ",
"overall ",
"environment ",
"match ",
"the ",
"game's ",
"",
"style. ",
"The ",
"user ",
"might ",
"be ",
"",
"",
"in-game, ",
"so ",
"the ",
"answer ",
"should ",
"confirm ",
"that ",
"they're ",
"in ",
"",
"",
"Cyberpunk ",
"",
"",
"",
"2077, ",
"specifically ",
"at ",
"The ",
"",
"Heist ",
"",
"bar. ",
"I ",
"should ",
"mention ",
"the ",
"game's ",
"title ",
"and ",
"the ",
"",
"location's ",
"",
"name. ",
"",
"Also, ",
"note ",
"the ",
"HUD ",
"elements ",
"as ",
"part ",
"of ",
"the ",
"game's ",
"",
"interface. ",
"Make ",
"sure ",
"to ",
"keep ",
"it ",
"concise ",
"but ",
"",
"",
"",
"",
"",
"",
"",
"",
"informative.◁/think▷You ",
"are ",
"in ",
"",
"",
"",
"**Cyberpunk ",
"",
"",
"",
"2077**, ",
"a ",
"",
"role-playing ",
"video ",
"game ",
"set ",
"in ",
"a ",
"",
"dystopian ",
"future ",
"version ",
"of ",
"Night ",
"",
"City. ",
"The ",
"image ",
"shows ",
"you ",
"inside ",
"",
"**The ",
"",
"",
"Heist**, ",
"a ",
"bar ",
"or ",
"nightclub ",
"popular ",
"in ",
"the ",
"",
"game. ",
"The ",
"",
"",
"cyberpunk ",
"",
"aesthetic, ",
"neon ",
"",
"lighting, ",
"and ",
"HUD ",
"elements ",
"",
"",
"",
"(mini-map, ",
"objectives ",
"like ",
"",
"\"Sit ",
"next ",
"to ",
"",
"Jackie\") ",
"align ",
"with ",
"the ",
"game's ",
"signature ",
"",
"style. ",
"This ",
"location ",
"is ",
"known ",
"for ",
"its ",
"vibrant ",
"atmosphere ",
"and ",
"interactions ",
"with ",
"characters ",
"like ",
"Jackie ",
"",
"",
"Welles, ",
"a ",
"key ",
"figure ",
"in ",
"the ",
"early ",
"",
"",
"story.<|im_end|>"
],
"data_removed": false,
"error": null,
"source": "web",
"status": "succeeded",
"created_at": "2025-05-01T12:13:48.911Z",
"started_at": "2025-05-01T12:15:29.472849Z",
"completed_at": "2025-05-01T12:15:44.716399Z",
"urls": {
"cancel": "https://api.replicate.com/v1/predictions/0b9g4m3fxxrme0cphf2ajakdq0/cancel",
"get": "https://api.replicate.com/v1/predictions/0b9g4m3fxxrme0cphf2ajakdq0",
"stream": "https://stream-b.svc.ric1.c.replicate.net/v1/streams/6vxkxygbil3befnlyhlj6f27bp5b2olb5ohqmxetaiwbqh5ytugq",
"web": "https://replicate.com/p/0b9g4m3fxxrme0cphf2ajakdq0"
},
"metrics": {
"predict_time": 15.243550127,
"total_time": 115.805399
}
}