✋ This model is not published yet.

You can claim this model if you're @j-min on GitHub. Contact us.

j-min/vl-t5

Unifying Vision-and-Language Tasks via Text Generation

Public
714 runs

Unifying Vision-and-Language Tasks via Text Generation

Reference

Please cite our paper if you use our models in your works:

@inproceedings{cho2021vlt5,
  title     = {Unifying Vision-and-Language Tasks via Text Generation},
  author    = {Jaemin Cho and Jie Lei and Hao Tan and Mohit Bansal},
  booktitle = {ICML},
  year      = {2021}
}