lucataco / paligemma-3b-pt-224

PaliGemma 3B, an open VLM by Google, pre-trained with 224*224 input images and 128 token input/output text sequences

  • Public
  • 437 runs
  • GitHub
  • Paper
  • License

Want to make some of these yourself?

Run this model