Run time and cost

This model costs approximately $0.0090 to run on Replicate, or 111 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia A100 (80GB) GPU hardware. Predictions typically complete within 7 seconds.

Readme

CodeGen

This is an unofficial implementation not affiliated with Salesforce.

Sample from the codegen-6B-mono checkpoint trained on python.

Arxiv: A Conversational Paradigm for Program Synthesis

Authors: Erik Nijkamp*, Bo Pang*, Hiroaki Hayashi*, Lifu Tu, Huan Wang, Yingbo Zhou, Silvio Savarese, and Caiming Xiong (* indicates equal contribution)

Released models

The models are named in the following format:

codegen-{model-size}-{data}

model-size has 4 options: 350M, 2B, 6B, 16B, which represent the number of parameters in each model.

data has 3 options: nl, multi, mono.

nl models are randomly initialized and trained on The Pile, a 825.18 GB English text corpous.
multi models are initialized from nl models and then trained on a corpus with code data consisting of multiple programming languages.
mono models are initialized from multi models and then trained on a corpus with Python code data.

Download the model parameters

Citation

If you find our code or paper useful, please cite the paper:

@article{Nijkamp2022ACP,
  title={A Conversational Paradigm for Program Synthesis},
  author={Nijkamp, Erik and Pang, Bo and Hayashi, Hiroaki and Tu, Lifu and Wang, Huan and Zhou, Yingbo and Savarese, Silvio and Xiong, Caiming},
  journal={arXiv preprint},
  year={2022}
}

License

Our code is BSD-3 licensed. See LICENSE.txt for details.

Run time and cost

Readme

CodeGen

Released models

Download the model parameters

codegen-350M-nl,multi,mono

codegen-2B-nl,multi,mono

codegen-6B-nl,multi,mono

codegen-16B-nl,multi,mono

Citation

License