chenxwh/cogvlm2

CogVLM2: Visual Language Models for Image and Video Understanding

Public
3K runs

Want to make some of these yourself?

Run this model