adirik
/
bunny-phi-2-siglip
Lightweight multimodal model for visual question answering, reasoning and captioning
Lightweight multimodal model for visual question answering, reasoning and captioning