adirik
/
bunny-phi-2-siglip
Lightweight multimodal model for visual question answering, reasoning and captioning
Want to make some of these yourself?
Run this model