Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
This model has no enabled versions.
Refer to the GitHub repository for more information.