lucataco / florence-2-base

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

  • Public
  • 84.6K runs
  • GitHub
  • Paper
  • License
  • Prediction

    lucataco/florence-2-base:c81609117f666d3a86b262447f80d41ac5158a76adb56893301843a23165eaf8
    ID
    kxppc64b19rgm0cga53bw8ztmm
    Status
    Succeeded
    Source
    Web
    Hardware
    A40
    Total duration
    Created

    Input

    image
    image
    task_input
    Caption

    Output

    {'<CAPTION>': 'A green car parked in front of a yellow building.'}
    Generated in
  • Prediction

    lucataco/florence-2-base:c81609117f666d3a86b262447f80d41ac5158a76adb56893301843a23165eaf8
    ID
    8134v1pw5xrgj0cga53s9s8vh8
    Status
    Succeeded
    Source
    Web
    Hardware
    A40
    Total duration
    Created

    Input

    image
    image
    task_input
    Detailed Caption

    Output

    {'<DETAILED_CAPTION>': 'The image shows a green Volkswagen Beetle parked in front of a yellow building with two brown doors, surrounded by trees and a clear blue sky.'}
    Generated in
  • Prediction

    lucataco/florence-2-base:c81609117f666d3a86b262447f80d41ac5158a76adb56893301843a23165eaf8
    ID
    f0mfxk7w8hrgj0cga53vfm18sg
    Status
    Succeeded
    Source
    Web
    Hardware
    A40
    Total duration
    Created
    by @lucataco

    Input

    image
    image
    task_input
    Object Detection

    Output

    img

    img

    text

    {'<OD>': {'bboxes': [[34.23999786376953, 160.0800018310547, 597.4400024414062, 371.7599792480469], [272.32000732421875, 241.67999267578125, 303.67999267578125, 247.4399871826172], [454.0799865722656, 276.7200012207031, 553.9199829101562, 370.79998779296875], [96.31999969482422, 280.55999755859375, 198.0800018310547, 371.2799987792969]], 'labels': ['car', 'door handle', 'wheel', 'wheel']}}
    Generated in
  • Prediction

    lucataco/florence-2-base:c81609117f666d3a86b262447f80d41ac5158a76adb56893301843a23165eaf8
    ID
    2p2sw6v2x5rgj0cga55bx9280r
    Status
    Succeeded
    Source
    Web
    Hardware
    A40
    Total duration
    Created

    Input

    image
    image
    task_input
    Caption to Phrase Grounding
    text_input
    A green car parked in front of a yellow building

    Output

    img

    img

    text

    {'<CAPTION_TO_PHRASE_GROUNDING>': {'bboxes': [[35.52000045776367, 159.1199951171875, 582.719970703125, 375.1199951171875], [0.3199999928474426, 0.23999999463558197, 639.0399780273438, 305.03997802734375]], 'labels': ['A green car', 'a yellow building']}}
    Generated in
  • Prediction

    lucataco/florence-2-base:c81609117f666d3a86b262447f80d41ac5158a76adb56893301843a23165eaf8
    ID
    9p9f26zz4srgp0cga55see8vtw
    Status
    Succeeded
    Source
    Web
    Hardware
    A40
    Total duration
    Created

    Input

    image
    image
    task_input
    Region Proposal

    Output

    img

    img

    text

    {'<REGION_PROPOSAL>': {'bboxes': [[34.23999786376953, 160.0800018310547, 597.4400024414062, 371.7599792480469], [455.3599853515625, 97.19999694824219, 580.1599731445312, 261.8399963378906], [95.68000030517578, 280.55999755859375, 198.0800018310547, 371.2799987792969], [454.0799865722656, 276.7200012207031, 553.9199829101562, 370.79998779296875], [66.87999725341797, 267.1199951171875, 87.36000061035156, 295.44000244140625], [438.7200012207031, 239.75999450683594, 463.03997802734375, 257.0400085449219], [272.32000732421875, 241.67999267578125, 303.67999267578125, 247.4399871826172], [491.8399963378906, 183.59999084472656, 520.0, 187.9199981689453]], 'labels': ['', '', '', '', '', '', '', '']}}
    Generated in

Want to make some of these yourself?

Run this model