lucataco
/
paligemma-3b-pt-224
PaliGemma 3B, an open VLM by Google, pre-trained with 224*224 input images and 128 token input/output text sequences
Want to make some of these yourself?
Run this model