lucataco / florence-2-large

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks (Updated 11 months, 3 weeks ago)

  • Public
  • 155.6K runs
  • GitHub
  • Paper
  • License
  • Prediction

    lucataco/florence-2-large:da53547e17d45b9cfb48174b2f18af8b83ca020fa76db62136bf9c6616762595
    ID
    apkjznnwgsrgp0cga58b2rgpgm
    Status
    Succeeded
    Source
    Web
    Hardware
    A40
    Total duration
    Created

    Input

    image
    image
    task_input
    Object Detection

    Output

    img

    img

    text

    {'<OD>': {'bboxes': [[34.23999786376953, 160.0800018310547, 597.4400024414062, 371.7599792480469], [456.0, 97.68000030517578, 580.1599731445312, 261.8399963378906], [450.8800048828125, 276.7200012207031, 554.5599975585938, 370.79998779296875], [95.68000030517578, 280.55999755859375, 198.72000122070312, 371.2799987792969]], 'labels': ['car', 'door', 'wheel', 'wheel']}}
    Generated in
  • Prediction

    lucataco/florence-2-large:da53547e17d45b9cfb48174b2f18af8b83ca020fa76db62136bf9c6616762595
    ID
    p9vn496sqnrgg0cga58b20bmk4
    Status
    Succeeded
    Source
    Web
    Hardware
    A40
    Total duration
    Created

    Input

    image
    image
    task_input
    Caption

    Output

    {'<CAPTION>': 'A green car parked in front of a yellow building.'}
    Generated in
  • Prediction

    lucataco/florence-2-large:da53547e17d45b9cfb48174b2f18af8b83ca020fa76db62136bf9c6616762595
    ID
    vyzgpdzjz5rgp0cga588t5d61w
    Status
    Succeeded
    Source
    Web
    Hardware
    A40
    Total duration
    Created

    Input

    image
    image
    task_input
    Detailed Caption

    Output

    {'<DETAILED_CAPTION>': 'The image shows a blue Volkswagen Beetle parked in front of a yellow building with two brown doors, surrounded by trees and a clear blue sky.'}
    Generated in
  • Prediction

    lucataco/florence-2-large:da53547e17d45b9cfb48174b2f18af8b83ca020fa76db62136bf9c6616762595
    ID
    28dsk3g8a5rgp0cga58tbe5vzm
    Status
    Succeeded
    Source
    Web
    Hardware
    A40
    Total duration
    Created

    Input

    image
    image
    task_input
    More Detailed Caption

    Output

    {'<MORE_DETAILED_CAPTION>': 'The image shows a vintage Volkswagen Beetle car parked on a cobblestone street in front of a yellow building with two wooden doors. The car is painted in a bright turquoise color and has a white stripe running along the side. It has two doors on either side of the car, one on top of the other, and a small window on the front. The building appears to be old and dilapidated, with peeling paint and crumbling walls. The sky is blue and there are trees in the background.'}
    Generated in
  • Prediction

    lucataco/florence-2-large:da53547e17d45b9cfb48174b2f18af8b83ca020fa76db62136bf9c6616762595
    ID
    mkv9v3tg8nrgj0cga58t9tewcg
    Status
    Succeeded
    Source
    Web
    Hardware
    A40
    Total duration
    Created

    Input

    image
    image
    task_input
    Caption to Phrase Grounding
    text_input
    A green car parked in front of a yellow building.

    Output

    img

    img

    text

    {'<CAPTION_TO_PHRASE_GROUNDING>': {'bboxes': [[34.880001068115234, 158.63999938964844, 583.3599853515625, 374.6399841308594], [0.3199999928474426, 4.079999923706055, 639.0399780273438, 305.03997802734375]], 'labels': ['A green car', 'a yellow building']}}
    Generated in
  • Prediction

    lucataco/florence-2-large:da53547e17d45b9cfb48174b2f18af8b83ca020fa76db62136bf9c6616762595
    ID
    31xdc72d4srgm0cga59b26fvwg
    Status
    Succeeded
    Source
    Web
    Hardware
    A40
    Total duration
    Created

    Input

    image
    image
    task_input
    Region Proposal

    Output

    img

    img

    text

    {'<REGION_PROPOSAL>': {'bboxes': [[33.599998474121094, 160.0800018310547, 596.7999877929688, 371.7599792480469], [455.3599853515625, 97.68000030517578, 579.5199584960938, 261.8399963378906], [450.8800048828125, 276.7200012207031, 553.2799682617188, 370.79998779296875], [95.04000091552734, 280.55999755859375, 198.0800018310547, 371.2799987792969], [226.87998962402344, 88.55999755859375, 332.47998046875, 164.39999389648438], [65.5999984741211, 266.6399841308594, 86.72000122070312, 295.91998291015625], [271.67999267578125, 241.67999267578125, 302.3999938964844, 246.95999145507812], [408.0, 308.3999938964844, 413.7599792480469, 320.8800048828125]], 'labels': ['', '', '', '', '', '', '', '']}}
    Generated in
  • Prediction

    lucataco/florence-2-large:da53547e17d45b9cfb48174b2f18af8b83ca020fa76db62136bf9c6616762595
    ID
    dtm0z83cr9rgg0cga599v4793r
    Status
    Succeeded
    Source
    Web
    Hardware
    A40
    Total duration
    Created

    Input

    image
    image
    task_input
    Dense Region Caption

    Output

    img

    img

    text

    {'<DENSE_REGION_CAPTION>': {'bboxes': [[33.599998474121094, 160.0800018310547, 596.7999877929688, 371.7599792480469], [450.8800048828125, 276.7200012207031, 553.2799682617188, 370.79998779296875], [95.04000091552734, 280.55999755859375, 197.44000244140625, 371.2799987792969]], 'labels': ['turquoise Volkswagen Beetle', 'wheel', 'wheel']}}
    Generated in

Want to make some of these yourself?

Run this model