✋ This model is not published yet.

You can claim this model if you're @j-min on GitHub. Contact us.

j-min/vl-t5

Unifying Vision-and-Language Tasks via Text Generation

Public
714 runs
  • Prediction

    j-min/vl-t5:560b0f84
    ID
    dyrpwgljczhabhnrwfviv633qi
    Status
    Succeeded
    Source
    Web
    Hardware
    Total duration
    Created

    Input

    image
    image
    question
    what is in the picture?

    Output

    dog
    Generated in
  • Prediction

    j-min/vl-t5:560b0f84
    ID
    cfdj2xlfwraxdjyivkbwa3b3jq
    Status
    Succeeded
    Source
    Web
    Hardware
    Total duration
    Created

    Input

    image
    image
    question
    where are they?

    Output

    beach
    Generated in

Want to make some of these yourself?

Run this model