adirik / grounding-dino

Detect everything with language!

  • Public
  • 7.3M runs
  • L40S
  • GitHub
  • Paper
  • License
Run with an API
  • Prediction

    adirik/grounding-dino:efd10a8ddc57ea28773327e881ce95e20cc1d734c589f7dd01d2036921ed78aa
    ID
    57jvoqzc4xbh2in6gow2oq7hci
    Status
    Succeeded
    Source
    Web
    Hardware
    T4
    Total duration
    Created

    Input

    image
    image
    query
    pink mug
    box_threshold
    0.2
    text_threshold
    0.2
    show_visualisation

    Output

    detections

    { "bbox": [ 19, 204, 408, 563 ], "label": "pink mug", "confidence": 0.8077122569084167 }
    { "bbox": [ 545, 263, 952, 650 ], "label": "pink mug", "confidence": 0.7644544839859009 }
    { "bbox": [ 416, 60, 764, 380 ], "label": "pink mug", "confidence": 0.4754282832145691 }
    { "bbox": [ 909, 161, 1078, 487 ], "label": "pink mug", "confidence": 0.43150201439857483 }

    result_image

    result_image
    Generated in
  • Prediction

    adirik/grounding-dino:efd10a8ddc57ea28773327e881ce95e20cc1d734c589f7dd01d2036921ed78aa
    ID
    24j3axzc4tmw2gpcgudgd6j7ru
    Status
    Succeeded
    Source
    Web
    Hardware
    T4
    Total duration
    Created

    Input

    image
    image
    query
    a cat, a dog
    box_threshold
    0.25
    text_threshold
    0.2
    show_visualisation

    Output

    detections

    { "bbox": [ 449, 363, 748, 624 ], "label": "a cat", "confidence": 0.5377296805381775 }
    { "bbox": [ 116, 200, 427, 644 ], "label": "a dog", "confidence": 0.4842470586299896 }
    { "bbox": [ 829, 236, 1113, 617 ], "label": "a dog", "confidence": 0.523659348487854 }

    result_image

    result_image
    Generated in

Want to make some of these yourself?

Run this model