adalab-ai / kandinsky_v2_2

Generate images with Kandinsky 2.2 - Mix text and images

  • Public
  • 42.7K runs
  • GitHub
  • License
Iterate in playground

Input

string

Choose a task

Default: "text2img"

string

Choose a scheduler

Default: "unipc"

string
Shift + Return to add a new line

Provide input prompt

Default: "A alien cheeseburger creature eating itself, claymation, cinematic, moody lighting"

string
Shift + Return to add a new line

Specify things to not see in the output for text2img and text_guided_img2img tasks

Default: "ugly, tiling, oversaturated, undersaturated, poorly drawn hands, poorly drawn feet, poorly drawn face, out of frame, extra limbs, disfigured, deformed, body out of frame, blurry, bad anatomy, blurred, watermark, grainy, signature, cut off, draft"

file

Input image for text2img

integer
(minimum: 64, maximum: 1024)

Width of output image. Reduce the seeting if hits memory limits

Default: 512

integer
(minimum: 64, maximum: 1024)

Height of output image. Reduce the seeting if hits memory limits

Default: 512

integer
(minimum: 1, maximum: 4)

Number of images to output.

Default: 1

integer
(minimum: 1, maximum: 500)

Number of denoising steps in prior

Default: 2

integer
(minimum: 1, maximum: 500)

Number of denoising steps

Default: 18

number
(minimum: 1, maximum: 10)

Scale for classifier-free guidance

Default: 4

integer

Random seed. Leave blank to randomize the seed

number
(minimum: 0, maximum: 10)

Weight of image - larger than 1 means more weight to image, lower than 0 is more weight to text

Default: 1

Output

output
Generated in

Run time and cost

This model costs approximately $0.0019 to run on Replicate, or 526 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia A100 (80GB) GPU hardware. Predictions typically complete within 2 seconds. The predict time for this model varies significantly based on the inputs.

Readme

This model doesn't have a readme.