chenxwh / meissonic

Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

  • Public
  • 35 runs
  • L40S
  • GitHub
  • Weights
  • Paper
  • License

Input

string
Shift + Return to add a new line

Input prompt

Default: "a photo of an astronaut riding a horse on mars"

string
Shift + Return to add a new line

Specify things to not see in the output

Default: "worst quality, low quality, low res, blurry, distortion, watermark, logo, signature, text, jpeg artifacts, signature, sketch, duplicate, ugly, identifying mark"

integer
(minimum: 1, maximum: 100)

Number of denoising steps

Default: 64

number
(minimum: 0, maximum: 20)

Scale for classifier-free guidance

Default: 9

integer

Random seed. Leave blank to randomize the seed

Output

output
Generated in

Run time and cost

This model runs on Nvidia L40S GPU hardware. We don't yet have enough runs of this model to provide performance information.

Readme

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Meissonic Banner

Meissonic Demos

🚀 Introduction

Meissonic is a non-autoregressive mask image modeling text-to-image synthesis model that can generate high-resolution images. It is designed to run on consumer graphics cards.

Key Features:
- 🖼️ High-resolution image generation (up to 1024x1024)
- 💻 Designed to run on consumer GPUs
- 🎨 Versatile applications: text-to-image, image-to-image

📚 Citation

If you find this work helpful, please consider citing:

@article{bai2024meissonic,
  title={Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis},
  author={Bai, Jinbin and Ye, Tian and Chow, Wei and Song, Enxin and Chen, Qing-Guo and Li, Xiangtai and Dong, Zhen and Zhu, Lei and Yan, Shuicheng},
  journal={arXiv preprint arXiv:2410.08261},
  year={2024}
}

🙏 Acknowledgements

We thank the community and contributors for their invaluable support in developing Meissonic. We thank apolinario@multimodal.art for making Meissonic Demo. We thank @NewGenAI and @飛鷹しずか@自称文系プログラマの勉強 for making YouTube toturials. We thank @pprp for making fp8 and int4 quantization. We thank @camenduru for making jupyter toturial