You're looking at a specific version of this model. Jump to the model overview.

vectorspacelab /omnigen:af66691a

Input

string
Shift + Return to add a new line

Input prompt. For multi-modal to image generation with one or more input images, the placeholder in the prompt should be in the format of <img><|image_*|></img> (for the first image, the placeholder is <|image_1|>, for the second image, the the placeholder is <|image_2|>). Refer to examples for more details

Default: "a photo of an astronaut riding a horse on mars"

file

Input image 1. Optional

file

Input image 2. Optional

file

Input image 3. Optional

integer
(minimum: 128, maximum: 2048)

Width of the output image

Default: 1024

integer
(minimum: 128, maximum: 2048)

Height of the output image

Default: 1024

integer
(minimum: 1, maximum: 100)

Number of denoising steps

Default: 50

number
(minimum: 1, maximum: 5)

Classifier-free guidance scale for text prompt

Default: 2.5

number
(minimum: 1, maximum: 2)

Classifier-free guidance scale for images

Default: 1.6

integer

Random seed. Leave blank to randomize the seed

integer
(minimum: 128, maximum: 2048)

maximum input image size

Default: 1024

boolean

Whether to use separate inference process for different guidance. This will reduce the memory cost.

Default: true

boolean

Offload model to CPU, which will significantly reduce the memory cost but slow down the generation speed. You can cancel separate_cfg_infer and set offload_model=True. If both separate_cfg_infer and offload_model are True, further reduce the memory, but slowest generation

Default: false

boolean

Automatically adjust the output image size to be same as input image size. For editing and controlnet task, it can make sure the output image has the same size as input image leading to better performance

Default: false

Output

No output yet! Press "Submit" to start a prediction.