chenxwh / cogvlm2-video

CogVLM2: Visual Language Models for Image and Video Understanding

  • Public
  • 655.9K runs
  • L40S
  • GitHub
  • Weights
  • Paper
  • License
Iterate in playground
  • Prediction

    chenxwh/cogvlm2-video:c435b0a753cd8f179afdcb2a4058767c3b9fd4139b04b7c60a609472ff276654
    ID
    0s7bfy5ffsrgj0cj46cr13fvhg
    Status
    Succeeded
    Source
    Web
    Hardware
    A40 (Large)
    Total duration
    Created

    Input

    prompt
    hen did the banquet scene appear in the video?
    input_video

    Output

    The banquet scene appeared at the 11th second in the video.
    Generated in
  • Prediction

    chenxwh/cogvlm2-video:9da7e9a554d36bb7b5fec36b43b00e4616dc1e819bc963ded8e053d8d8196cb5
    ID
    3qzkhp04p1rgg0cj4p79ctnrwc
    Status
    Succeeded
    Source
    Web
    Hardware
    A40 (Large)
    Total duration
    Created

    Input

    top_p
    0.1
    prompt
    When did the hand appear?
    input_video
    temperature
    0.1
    max_new_tokens
    2048

    Output

    The hand appeared at the 6th second in the video.
    Generated in
  • Prediction

    chenxwh/cogvlm2-video:9da7e9a554d36bb7b5fec36b43b00e4616dc1e819bc963ded8e053d8d8196cb5
    ID
    p5vgfjja2drgg0cj4p98kfg364
    Status
    Succeeded
    Source
    Web
    Hardware
    A40 (Large)
    Total duration
    Created
    by @chenxwh

    Input

    top_p
    0.1
    prompt
    请仔细描述这个视频
    input_video
    temperature
    0.1
    max_new_tokens
    2048

    Output

    In the video, we see a large elephant walking across a dry grassland. The elephant's skin is covered in a vibrant, rainbow-colored pattern. The elephant's ears are large and floppy, and it has a long, curved trunk. The elephant's eyes are visible, and it appears to be moving purposefully. The background is a clear blue sky, and there are no other objects or creatures in sight. The elephant's colorful skin stands out against the natural surroundings, creating a striking visual contrast.
    Generated in

Want to make some of these yourself?

Run this model