nvlabs/ prismer

A Vision-Language Model with An Ensemble of Experts

Want to make some of these yourself?

Run this model