jagilley / stable-diffusion-upscaler

Upscale images with Stable Diffusion

  • Public
  • 3.5K runs
  • T4
  • GitHub

Input

image
*file

Image to be upscaled

number
(minimum: 0, maximum: 10)

Factor to scale image by

Default: 1.5

string
Shift + Return to add a new line

Prompt. Not strictly required but can subtly affect the upscaling result.

Default: ""

integer

Number of samples to generate

Default: 1

integer

Batch size

Default: 1

number
(minimum: 0, maximum: 10)

Scale factor for guidance

Default: 1

string

Decoder to use

Default: "finetuned_840k"

number
(minimum: 0, maximum: 0.6)

Noise augmentation level

Default: 0

string

Noise augmentation type

Default: "gaussian"

string

Sampler to use

Default: "k_dpm_adaptive"

integer

Number of steps to take in the diffusion process

Default: 10

number

Tolerance scale

Default: 0.25

number

ETA

Default: 1

Output

output
Generated in

Run time and cost

This model costs approximately $0.077 to run on Replicate, or 12 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia T4 GPU hardware. Predictions typically complete within 6 minutes. The predict time for this model varies significantly based on the inputs.

Readme

Upscale images with Stable Diffusion, optionally including a prompt to subtly alter the input image.

Model description

A latent diffusion upscaler for the Stable Diffusion autoencoder.

      @InProceedings{Rombach_2022_CVPR,
          author    = {Rombach, Robin and Blattmann, Andreas and Lorenz, Dominik and Esser, Patrick and Ommer, Bj\"orn},
          title     = {High-Resolution Image Synthesis With Latent Diffusion Models},
          booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
          month     = {June},
          year      = {2022},
          pages     = {10684-10695}
      }