joehoover / mplug-owl

An instruction-tuned multimodal large language model that generates text based on user-provided prompts and images

  • Public
  • 55.7K runs
  • GitHub
  • Paper
  • License

Want to make some of these yourself?

Run this model