nateraw/video-llava – API reference

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection