Table of contents
ControlNet tile is a ControlNet model (control_v11f1e_sd15_tile) introduced by Lvmin Zhang (lllyasviel) in May 2023.
You can run it on Replicate with the following models:
Use ControlNet tile to:
It is very good at:
However, it:
It upscales by hallucinating new details. You can use a text prompt (and negative prompt), to guide the generation of details towards your desired image.
As the name suggests, it is a tile-based approach. The original image is split into tiles, and each tile is upscaled separately before being recombined. Previous tiling approaches were limited by the way each tile interpreted a given prompt. Consider a ‘photo of a man outside’, split into 9, where the top left tile is just sky. Early techniques would try to diffuse an image of a man into that space, as well as every other tile, without a wider understanding of the whole image. ControlNet tile is different.
The clever part with ControlNet tile is that despite the tiles being upscaled separately, the prompt is always applied to the whole image.
ControlNet tile is best used with any Stable Diffusion 1.5 fine-tune. RealisticVision V5.1 and above are a good choice for photorealistic upscaling. There isn’t currently a ControlNet tile model for SDXL.
In this example of a woman in a bright outfit, the image is upscaled from 1024x1024 (a standard SDXL sized output) to 2560x2560 (2.5x) in 38 seconds.
You can clearly see the issues ControlNet tile can have with maintaining a likeness here. Not only has this woman’s likeness changed, but so has her ethnicity. You’ll also see that her face is no longer distorted, and that the pose and colors have remained consistent. Small details like the buttons and the shirt pattern are also fixed. Meanwhile new details have been introduced - the woman is now wearing a necklace, she is wearing less makeup and the shiny material of the outfit is more pared back.
In another example, we upscale a photo of a cat using the fewjative/ultimate-sd-upscale
model. The original image is 1024x1024, and we upscale it 2x to 2048x2048 in 49 seconds. We use a prompt to describe the image, and a negative prompt to list things we don’t want.
See how the detail in the cat’s fur is improved, and AI generation errors like the zip and eyes are fixed.
Using a ControlNet tile upscaler in the cloud is a great way to improve the quality of images in your website or app. Whether they are from a generative AI model or a real-world source.
We’ll use the batouresearch/high-resolution-controlnet-tile
for this example, but the API is similar whichever model you choose.
Start with our official JavaScript client to run the model with Node.js:
Run batouresearch/high-resolution-controlnet-tile with the following code:
You can also run this model using other Replicate client libraries such as for Python, Golang, Swift, Elixir, and others.