yorickvp / llava-13b

Visual instruction tuning towards large language and vision models with GPT-4 level capabilities

  • Public
  • 9.1M runs
  • GitHub
  • Paper
  • License