You're looking at a specific version of this model. Jump to the model overview.

nvlabs/prismer:308eb68a

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
image
string
Input image
model_size
string (enum)
base

Options:

base, large

Choose a model
use_experts
boolean
True
Load the experts if set to True
task
string (enum)
caption

Options:

vqa, caption

Choose between Visual Question Answering and Image Captioning
question
string
Provide your question for Visual Question Answering task

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'properties': {'answer': {'title': 'Answer', 'type': 'string'},
                'depth': {'format': 'uri', 'title': 'Depth', 'type': 'string'},
                'edge': {'format': 'uri', 'title': 'Edge', 'type': 'string'},
                'obejct_labels': {'title': 'Obejct Labels'},
                'ocr': {'format': 'uri', 'title': 'Ocr', 'type': 'string'},
                'segmentation': {'format': 'uri',
                                 'title': 'Segmentation',
                                 'type': 'string'},
                'surface_normal': {'format': 'uri',
                                   'title': 'Surface Normal',
                                   'type': 'string'}},
 'required': ['answer'],
 'title': 'ModelOutput',
 'type': 'object'}