✋ This model is not published yet.

You can claim this model if you're @j-min on GitHub. Contact us.

j-min / vl-t5

Unifying Vision-and-Language Tasks via Text Generation

  • Public
  • 685 runs
  • GitHub
  • Paper
  • License

Readme

Unifying Vision-and-Language Tasks via Text Generation

Reference

Please cite our paper if you use our models in your works:

@inproceedings{cho2021vlt5,
  title     = {Unifying Vision-and-Language Tasks via Text Generation},
  author    = {Jaemin Cho and Jie Lei and Hao Tan and Mohit Bansal},
  booktitle = {ICML},
  year      = {2021}
}