arunarupa / instantid-image-size-increase

Same instant model multricontrolnet that already existed just with the 1024px limit removed when using a reference image

  • Public
  • 738 runs
  • L40S

Input

*file

Input face image

file

(Optional) reference pose image

string
Shift + Return to add a new line

Input prompt

Default: "a person"

string
Shift + Return to add a new line

Input Negative Prompt

Default: ""

string

Pick which base weights you want to use

Default: "stable-diffusion-xl-base-1.0"

string

Scheduler

Default: "EulerDiscreteScheduler"

integer

Height of the output if reference picture is used

integer
(minimum: 1, maximum: 500)

Number of denoising steps

Default: 30

number
(minimum: 1, maximum: 50)

Scale for classifier-free guidance

Default: 7.5

number
(minimum: 0, maximum: 1.5)

Scale for image adapter strength (for detail)

Default: 0.8

number
(minimum: 0, maximum: 1.5)

Scale for IdentityNet strength (for fidelity)

Default: 0.8

boolean

Enable Openpose ControlNet, overrides strength if set to false

Default: true

number
(minimum: 0, maximum: 1)

Openpose ControlNet strength, effective only if `enable_pose_controlnet` is true

Default: 0.4

boolean

Enable Canny ControlNet, overrides strength if set to false

Default: false

number
(minimum: 0, maximum: 1)

Canny ControlNet strength, effective only if `enable_canny_controlnet` is true

Default: 0.3

boolean

Enable Depth ControlNet, overrides strength if set to false

Default: false

number
(minimum: 0, maximum: 1)

Depth ControlNet strength, effective only if `enable_depth_controlnet` is true

Default: 0.5

boolean

Enable Fast Inference with LCM (Latent Consistency Models) - speeds up inference steps, trade-off is the quality of the generated image. Performs better with close-up portrait face images

Default: false

integer
(minimum: 1, maximum: 10)

Only used when `enable_lcm` is set to True, Number of denoising steps when using LCM

Default: 5

number
(minimum: 1, maximum: 20)

Only used when `enable_lcm` is set to True, Scale for classifier-free guidance when using LCM

Default: 1.5

boolean

Enhance non-face region

Default: true

string

Format of the output images

Default: "webp"

integer
(minimum: 0, maximum: 100)

Quality of the output images, from 0 to 100. 100 is best quality, 0 is lowest quality.

Default: 80

integer

Random seed. Leave blank to randomize the seed

integer
(minimum: 1, maximum: 8)

Number of images to output

Default: 1

boolean

This model’s safety checker can’t be disabled when running on the website. Learn more about platform safety on Replicate.

Disable safety checker for generated images

Default: false

Output

No output yet! Press "Submit" to start a prediction.

Run time and cost

This model costs approximately $0.18 to run on Replicate, or 5 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia L40S GPU hardware. Predictions typically complete within 4 minutes. The predict time for this model varies significantly based on the inputs.

Readme

This model doesn't have a readme.