replicate / vicuna-13b

A large language model that's been fine-tuned on ChatGPT interactions

Demo API Examples README Versions (6282abe6)

Run time and cost

This model runs on Nvidia A100 (40GB) GPU hardware. Predictions typically complete within 2 seconds.