The Yi series models are large language models trained from scratch by developers at 01.AI.

This model runs on Nvidia A100 (80GB) GPU hardware. Predictions typically complete within 1 seconds.


See the full model card here. The model served here are the original, un-quantized weights.

NOTE: As per the license, replicate was granted permission to share the model here.