nomagick / qwen-vl-chat

Qwen-VL-Chat but with raw ChatML prompt interface and streaming

  • Public
  • 1.1K runs
  • L40S
  • GitHub
  • Paper
  • License
  • Prediction

    nomagick/qwen-vl-chat:e48b4267
    ID
    ya7n4q3bvboqtab5oarylfquo4
    Status
    Succeeded
    Source
    Web
    Hardware
    A40 (Large)
    Total duration
    Created

    Input

    top_p
    0.8
    image1
    image1
    prompt
    <|im_start|>system You are a helpful assistant<|im_end|> <|im_start|>user Given this image: <img>image1</img>, point out where the dog is<|im_end|> <|im_start|>assistant
    max_tokens
    2048
    temperature
    0.75

    Output

    <ref> the dog</ref><box>(215,422),(578,892)</box>
    Generated in
  • Prediction

    nomagick/qwen-vl-chat:e48b4267
    ID
    pswfzhtb5evjp5vo4jjwym7fgm
    Status
    Succeeded
    Source
    Web
    Hardware
    A40 (Large)
    Total duration
    Created

    Input

    top_p
    0.8
    image1
    null
    prompt
    <|im_start|>system You are a helpful assistant<|im_end|> <|im_start|>user Given this two images: Image1: <img>test1.png</img>, Image2: <img>qwen.png</img>, infer what these image are used for. <|im_end|> <|im_start|>assistant The image is used for a logo or symbol.<|im_end|> <|im_start|>user What is Image2 all about?<|im_end|> <|im_start|>assistant Image2 is a 3D logo of a blue star on a black background.<|im_end|> <|im_start|>user What is Image1 all about?<|im_end|> <|im_start|>assistant
    max_tokens
    2048
    temperature
    0.75
    files_archive
    archive.zip

    Output

    Image1 is a black and white image of a landscape with a mountain, a pavilion, and a river. The scene is set at sunset.
    Generated in
  • Prediction

    nomagick/qwen-vl-chat:e48b4267
    ID
    hz3qw6lb4cgjlijac7r5mx7vai
    Status
    Succeeded
    Source
    Web
    Hardware
    A40 (Large)
    Total duration
    Created

    Input

    top_p
    0.8
    image1
    image1
    image2
    image2
    prompt
    <|im_start|>system You are a helpful assistant<|im_end|> <|im_start|>user Given these two images: Image1: <img>image1</img>, Image2: <img>image2</img>, infer what these image are used for. <|im_end|> <|im_start|>assistant The image is used for a logo or symbol.<|im_end|> <|im_start|>user What is Image2 all about?<|im_end|> <|im_start|>assistant Image2 is a 3D logo of a blue star on a black background.<|im_end|> <|im_start|>user What is Image1 all about?<|im_end|> <|im_start|>assistant
    max_tokens
    2048
    temperature
    0.75
    files_archive
    null

    Output

    Image1 is a picture of a beautiful landscape with a mountain and a temple in the background.
    Generated in

Want to make some of these yourself?

Run this model