tomasmcm / pandalyst-7b-v1.2

Source: pipizhao/Pandalyst-7B-V1.2 ✦ Quant: TheBloke/Pandalyst-7B-v1.2-AWQ ✦ Pandalyst: A large language model for mastering data analysis using pandas

  • Public
  • 16 runs
  • Paper
  • License

Input

Output

Run time and cost

This model runs on Nvidia A40 GPU hardware.

Readme

Pandalyst: A large language model for mastering data analysis using pandas

https://github.com/pipizhaoa/Pandalyst

What is Pandalyst - Pandalyst is a general large language model specifically trained to process and analyze data using the pandas library.

How is Pandalyst - Pandalyst has strong generalization capabilities for data tables in different fields and different data analysis needs.

Why is Pandalyst - Pandalyst is open source and free to use, and its small parameter size (7B/13B) allows us to easily deploy it on local PC. - Pandalyst can handle complex data tables (multiple columns and multiple rows), allowing us to enter enough context to describe our table in detail. - Pandalyst has very competitive performance, significantly outperforming models of the same size and even outperforming some of the strongest closed-source models.

News

  • 🔥[2023/10/15] Now we can plot 📈! and much more powerful! We released Pandalyst-7B-V1.2, which was trained on CodeLlama-7b-Python and it surpasses ChatGPT-3.5 (2023/06/13), Pandalyst-7B-V1.1 and WizardCoder-Python-13B-V1.0 in our PandaTest_V1.0.
  • 🤖️[2023/09/30] We released Pandalyst-7B-V1.1 , which was trained on CodeLlama-7b-Python and achieves the 76.1 exec@1 in our PandaTest_V1.0 and surpasses WizardCoder-Python-13B-V1.0 and ChatGPT-3.5 (2023/06/13).
Model Checkpoint Support plot License
🔥Pandalyst-7B-V1.2 🤗 HF Link Llama2
Pandalyst-7B-V1.1 🤗 HF Link Llama2

Usage and Human evaluation

Please refer to Github.