zsxkib / uform-gen

🖼️ Super fast 1.5B Image Captioning/VQA Multimodal LLM (Image-to-Text) 🖋️

  • Public
  • 2.3K runs
  • L40S
  • GitHub
  • License
Iterate in playground
  • Prediction

    zsxkib/uform-gen:e6fa8e2d076907b45a0b535a14ddb22402548c2e478310cd18daa1c4c01f422b
    ID
    w4toeq3bcj23ksbqmsgjzhmblq
    Status
    Succeeded
    Source
    Web
    Hardware
    A40 (Large)
    Total duration
    Created

    Input

    image
    image
    prompt
    Describe the image in great detail

    Output

    A cat in a suit and bow tie stands in front of a gray background, looking at the camera with a curious expression. The suit and bow tie convey a formal and stylish appearance. The cat's attire and the suit's design imply it may be a pet or a formal event.
    Generated in

Want to make some of these yourself?

Run this model