Readme
Get text & image embeddings using CLIP.
Details
- Model used:
clip-vit-large-patch14
- Length of the embeddings:
768
Response
{
"embedding": [0.1, 0.2, ..., 0.5]
}
Generate CLIP (clip-vit-large-patch14) text & image embeddings
This model runs on Nvidia A40 GPU hardware. Predictions typically complete within 1 seconds. The predict time for this model varies significantly based on the inputs.
Get text & image embeddings using CLIP.
clip-vit-large-patch14
768
{
"embedding": [0.1, 0.2, ..., 0.5]
}