🚀 Want to run this model with an API? Get started

stability-ai/stable-diffusion

Public
A latent text-to-image diffusion model capable of generating photo-realistic images given any text input
72.5M runs
  • Add negative prompt.
  • Support for more image sizes.
  • Support for arbitrary number of outputs up to 10.
  • Fix multiple outputs not working.

Stable Diffusion 1.5. The checkpoint was initialized with the weights of the Stable-Diffusion-v1-2 checkpoint and subsequently fine-tuned on 595k steps at resolution 512x512 on "laion-aesthetics v2 5+" and 10% dropping of the text-conditioning to improve classifier-free guidance sampling.

Stable Diffusion 1.4. The checkpoint was initialized with the weights of the Stable-Diffusion-v-1-2 checkpoint and subsequently fine-tuned on 225k steps at resolution 512x512 on "laion-aesthetics v2 5+" and 10% dropping of the text-conditioning to improve classifier-free guidance sampling.