You're looking at a specific version of this model. Jump to the model overview.

yoadtew /zero-shot-image-to-text:7f2735ba

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
image
string
input image
cond_text
string
Image of a
conditional text
beam_size
integer
5

Min: 1

Max: 10

Number of beams to use
end_factor
number
1.01

Min: 1

Max: 1.1

Higher value for shorter captions
max_seq_length
integer
15

Min: 1

Max: 20

Maximum number of tokens to generate
ce_loss_scale
number
0.2

Max: 0.6

Scale of cross-entropy loss with un-shifted language model

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'title': 'Output', 'type': 'string'}