Examples – adirik/bunny-phi-2-siglip

Lightweight multimodal model for visual question answering, reasoning and captioning

Public

7.9K runs