tomasmcm/pandalyst-7b-v1.2 – Run with an API on Replicate

Readme

Pandalyst: A large language model for mastering data analysis using pandas

What is Pandalyst - Pandalyst is a general large language model specifically trained to process and analyze data using the pandas library.

How is Pandalyst - Pandalyst has strong generalization capabilities for data tables in different fields and different data analysis needs.

Why is Pandalyst - Pandalyst is open source and free to use, and its small parameter size (7B/13B) allows us to easily deploy it on local PC. - Pandalyst can handle complex data tables (multiple columns and multiple rows), allowing us to enter enough context to describe our table in detail. - Pandalyst has very competitive performance, significantly outperforming models of the same size and even outperforming some of the strongest closed-source models.

News

🔥[2023/10/15] Now we can plot 📈! and much more powerful! We released Pandalyst-7B-V1.2, which was trained on CodeLlama-7b-Python and it surpasses ChatGPT-3.5 (2023/06/13), Pandalyst-7B-V1.1 and WizardCoder-Python-13B-V1.0 in our PandaTest_V1.0.
🤖️[2023/09/30] We released Pandalyst-7B-V1.1 , which was trained on CodeLlama-7b-Python and achieves the 76.1 exec@1 in our PandaTest_V1.0 and surpasses WizardCoder-Python-13B-V1.0 and ChatGPT-3.5 (2023/06/13).

Model	Checkpoint	Support plot	License
🔥Pandalyst-7B-V1.2	🤗 HF Link	✅	Llama2
Pandalyst-7B-V1.1	🤗 HF Link	❌	Llama2

Usage and Human evaluation

Please refer to Github.