You're looking at a specific version of this model. Jump to the model overview.

chenxwh /cogvlm2-video:9da7e9a5

Input

*file

Input video

string
Shift + Return to add a new line

Input prompt

Default: "Describe this video."

number
(minimum: 0, maximum: 1)

When decoding text, samples from the top p percentage of most likely tokens; lower to ignore less likely tokens

Default: 0.1

number
(minimum: 0)

Adjusts randomness of outputs, greater than 1 is random and 0 is deterministic

Default: 0.1

integer
(minimum: 0)

Maximum number of tokens to generate. A word is generally 2-3 tokens

Default: 2048

Output

No output yet! Press "Submit" to start a prediction.