You're looking at a specific version of this model. Jump to the model overview.
Input schema
The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.
Field | Type | Default value | Description |
---|---|---|---|
image |
string
|
Input image
|
|
model_size |
string
(enum)
|
base
Options: base, large |
Choose a model
|
use_experts |
boolean
|
True
|
Load the experts if set to True
|
task |
string
(enum)
|
caption
Options: vqa, caption |
Choose between Visual Question Answering and Image Captioning
|
question |
string
|
Provide your question for Visual Question Answering task
|
Output schema
The shape of the response you’ll get when you run this model with an API.
Schema
{'properties': {'answer': {'title': 'Answer', 'type': 'string'},
'depth': {'format': 'uri', 'title': 'Depth', 'type': 'string'},
'edge': {'format': 'uri', 'title': 'Edge', 'type': 'string'},
'obejct_labels': {'title': 'Obejct Labels'},
'ocr': {'format': 'uri', 'title': 'Ocr', 'type': 'string'},
'segmentation': {'format': 'uri',
'title': 'Segmentation',
'type': 'string'},
'surface_normal': {'format': 'uri',
'title': 'Surface Normal',
'type': 'string'}},
'required': ['answer'],
'title': 'ModelOutput',
'type': 'object'}