joehoover / mplug-owl
An instruction-tuned multimodal large language model that generates text based on user-provided prompts and images
Input
Output
Want to make some of these yourself?
Run this modelAn instruction-tuned multimodal large language model that generates text based on user-provided prompts and images
Want to make some of these yourself?
Run this model