nvidia / prismer

A Vision-Language Model with An Ensemble of Experts

  • Public
  • 1.7K runs
  • T4
  • GitHub
  • Paper
  • License
  • Prediction

    nvidia/prismer:e604611d
    ID
    4x5slvr76fbmhbb5nsuwxau7ri
    Status
    Succeeded
    Source
    Web
    Hardware
    Total duration
    Created

    Input

    task
    vqa
    question
    what is the man sitting down doing?
    input_image
    input_image
    use_experts
    output_expert_labels

    Output

    riding bike
    Generated in
  • Prediction

    nvidia/prismer:e604611d
    ID
    gchbvguxovbkjhb7rgd74psnce
    Status
    Succeeded
    Source
    Web
    Hardware
    Total duration
    Created

    Input

    task
    caption
    model_size
    base
    input_image
    input_image
    use_experts
    output_expert_labels

    Output

    edge

    edge

    depth

    depth

    answer

    a baseball player swinging a bat at a ball

    ocr_pt

    ocr.pt

    ocr_png

    ocr_png

    object_png

    object_png

    segmentation

    segmentation

    object_labels

    { "0": 134, "1": 10, "2": 10, "3": 129, "4": 175, "5": 204, "6": 134, "7": 189, "8": 196, "9": 196 }

    surface_normal

    surface_normal
    Generated in
  • Prediction

    nvidia/prismer:e604611d
    ID
    gfzyqg7ubjfmvkqfx7ccs6lhfy
    Status
    Succeeded
    Source
    Web
    Hardware
    Total duration
    Created

    Input

    task
    caption
    model_size
    base
    input_image
    input_image
    use_experts
    output_expert_labels

    Output

    edge

    edge

    depth

    depth

    answer

    a man riding a skateboard across a cross walk

    ocr_pt

    ocr.pt

    ocr_png

    ocr_png

    object_png

    object_png

    segmentation

    segmentation

    object_labels

    { "0": 134, "1": 151, "2": 607, "3": 607, "4": 134, "5": 607, "6": 136, "7": 607, "8": 15, "9": 176, "10": 15, "11": 15, "12": 219, "13": 201 }

    surface_normal

    surface_normal
    Generated in
  • Prediction

    nvidia/prismer:e604611d
    ID
    6ied42qfhjhwbhsiffcxehc54e
    Status
    Succeeded
    Source
    Web
    Hardware
    Total duration
    Created

    Input

    task
    vqa
    question
    what is origin of the animal in the picture?
    model_size
    large
    input_image
    input_image
    use_experts
    output_expert_labels

    Output

    edge

    edge

    depth

    depth

    answer

    african elephant

    object_png

    object_png

    segmentation

    segmentation

    object_labels

    { "0": 173, "1": 209 }

    surface_normal

    surface_normal
    Generated in
  • Prediction

    nvidia/prismer:e604611d
    ID
    clyqudkpy5fzric3mbmcnyyysa
    Status
    Succeeded
    Source
    Web
    Hardware
    Total duration
    Created
    by @chenxwh

    Input

    task
    caption
    question
     
    model_size
    base
    input_image
    input_image
    use_experts
    output_expert_labels

    Output

    a man riding a skateboard across a cross walk
    Generated in

Want to make some of these yourself?

Run this model