yorickvp / llava-13b
Visual instruction tuning towards large language and vision models with GPT-4 level capabilities
Run time and cost
This model runs on Nvidia A40 (Large) GPU hardware. Predictions typically complete within 8 seconds.
Visual instruction tuning towards large language and vision models with GPT-4 level capabilities
This model runs on Nvidia A40 (Large) GPU hardware. Predictions typically complete within 8 seconds.