jagilley / stable-diffusion-upscaler

Upscale images with Stable Diffusion

  • Public
  • 3.2K runs
  • GitHub

Input

Output

Run time and cost

This model runs on Nvidia T4 GPU hardware. Predictions typically complete within 6 minutes. The predict time for this model varies significantly based on the inputs.

Readme

Upscale images with Stable Diffusion, optionally including a prompt to subtly alter the input image.

Model description

A latent diffusion upscaler for the Stable Diffusion autoencoder.

      @InProceedings{Rombach_2022_CVPR,
          author    = {Rombach, Robin and Blattmann, Andreas and Lorenz, Dominik and Esser, Patrick and Ommer, Bj\"orn},
          title     = {High-Resolution Image Synthesis With Latent Diffusion Models},
          booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
          month     = {June},
          year      = {2022},
          pages     = {10684-10695}
      }