Kosmos-G: Generating Images in Context with Multimodal Large Language Models
Want to make some of these yourself?