The Yi series models are large language models trained from scratch by developers at 01.AI. Today, they've released two new models: Yi-6B-Chat and Yi-34B-Chat. These models extend the base models, Yi-6B and Yi-34B, and are fine-tuned for chat completion.
Yi-34B currently holds the state-of-the-art for most benchmarks, beating larger models like Llama-70B.
Yi-34B-Chat is on Replicate and you can run it in the cloud with a few lines of code.
You can run it with our JavaScript client:
Or, our Python client:
Or, you can call the HTTP API directly with tools like cURL:
You can also run Yi chat models using other client libraries for Go, Swift, Elixir, and others.