nateraw / video-llava

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

  • Public
  • 481.5K runs
  • GitHub
  • Paper
  • License
  1. 26387f81

    Latest

    Release notes

    Updated to work with Video-LLaVA-7B commit b1d6a63f98cc93153d3e9ff295bd6dee5ffafe4c