jagilley / stable-diffusion-upscaler

Upscale images with Stable Diffusion

Demo API Examples Versions (4d0aeee7)


View more examples

Run time and cost

Predictions run on Nvidia T4 GPU hardware. Predictions typically complete within 6 minutes. The predict time for this model varies significantly based on the inputs.

Upscale images with Stable Diffusion, optionally including a prompt to subtly alter the input image.

Model description

A latent diffusion upscaler for the Stable Diffusion autoencoder.

          author    = {Rombach, Robin and Blattmann, Andreas and Lorenz, Dominik and Esser, Patrick and Ommer, Bj\"orn},
          title     = {High-Resolution Image Synthesis With Latent Diffusion Models},
          booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
          month     = {June},
          year      = {2022},
          pages     = {10684-10695}