✋ This model is not published yet.
You can claim this model if you're @j-min on GitHub. Contact us.
j-min
/
vl-t5
Unifying Vision-and-Language Tasks via Text Generation
Want to make some of these yourself?
Run this model