wavespeedai/qwen-image

A 20B MMDiT model for next-gen text-to-image generation

5.5K runs

Qwen-Image — a 20B MMDiT model for next-gen text-to-image generation. Especially strong at creating stunning graphic posters with native text. Now open-source.

Key Highlights:

  • SOTA text rendering — rivals GPT-4o in English, best-in-class for Chinese
  • In-pixel text generation — no overlays, fully integrated
  • Bilingual support, diverse fonts, complex layouts
  • Also excels at general image generation — from photorealistic to anime, impressionist to minimalist. A true creative powerhouse.