adirik / bunny-phi-2-siglip

Lightweight multimodal model for visual question answering, reasoning and captioning

  • Public
  • 3.9K runs
  • GitHub
  • Paper
  • License

Want to make some of these yourself?

Run this model