CogVLM2: Visual Language Models for Image and Video Understanding
Want to make some of these yourself?