yorickvp / llava-13b

Visual instruction tuning towards large language and vision models with GPT-4 level capabilities

  • Public
  • 24.6M runs
  • L40S
  • GitHub
  • Paper
  • License