prompthunt / cog-realvisxl2-lora-inference

  • Public
  • 274 runs

Run prompthunt/cog-realvisxl2-lora-inference with an API

Use one of our client libraries to get started quickly. Clicking on a library will take you to the Playground tab where you can tweak different inputs, see the results, and copy the corresponding code to use in your own project.

Input schema

The fields you can use to run this model with an API. If you don't give a value for a field its default value will be used.

Field Type Default value Description
lora_url
string
Load Lora model
prompt
string
A photo of TOK
Input prompt
negative_prompt
string
plastic, blurry, grainy, [deformed | disfigured], poorly drawn, [bad : wrong] anatomy, [extra | missing | floating | disconnected] limb, (mutated hands and fingers), blurry
Input Negative Prompt
image
string
Input image for img2img or inpaint mode
mask
string
Input mask for inpaint mode. Black areas will be preserved, white areas will be inpainted.
pose_image
string
Input pose image for controlnet mode
width
integer
768
Width of output image
height
integer
1024
Height of output image
num_outputs
integer
1

Min: 1

Max: 100

Number of images to output.
scheduler
string (enum)
DPM++SDEKarras

Options:

DDIM, DPMSolverMultistep, HeunDiscrete, KarrasDPM, K_EULER_ANCESTRAL, K_EULER, PNDM, DPM++SDEKarras

scheduler
num_inference_steps
integer
25

Min: 1

Max: 500

Number of denoising steps
guidance_scale
number
3

Min: 1

Max: 50

Scale for classifier-free guidance
prompt_strength
number
0.8

Max: 1

Prompt strength when using img2img / inpaint. 1.0 corresponds to full destruction of information in image
seed
integer
Random seed. Leave blank to randomize the seed
refine
string (enum)
no_refiner

Options:

no_refiner, expert_ensemble_refiner, base_image_refiner

Which refine style to use
high_noise_frac
number
0.8

Max: 1

For expert_ensemble_refiner, the fraction of noise to use
refine_steps
integer
For base_image_refiner, the number of steps to refine, defaults to num_inference_steps
lora_scale
number
0.8

Max: 1

LoRA additive scale. Only applicable on trained models.
mask_blur_amount
number
8
Amount of blur to apply to the mask.
face_padding
number
1.5
Amount of padding (as percentage) to add to the face bounding box.
face_resize_to
integer
1024
Resize the face bounding box to this size (in pixels).
inpaint_prompt
string
A photo of TOK
Input inpaint prompt
inpaint_negative_prompt
string
(worst quality, low quality, illustration, 3d, 2d, painting, cartoons, sketch), open mouth
Input inpaint negative prompt
inpaint_strength
number
0.35

Max: 1

Prompt strength when using inpaint. 1.0 corresponds to full destruction of information in image
inpaint_num_inference_steps
integer
25

Min: 1

Max: 500

Number of denoising steps for inpainting
inpaint_guidance_scale
number
3

Min: 1

Max: 50

Scale for classifier-free guidance for inpainting
inpaint_lora_scale
number
0.8

Max: 1

LoRA additive scale. Only applicable on trained models.
controlnet_conditioning_scale
number
1
Scale for guidance for controlnet
upscale_face_image
boolean
False
Upscales face image before inpainting
upscale_final_image
boolean
False
Upscales final image before returning
include_debug_output_images
boolean
True
Include debug output in the output

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{
  "type": "array",
  "items": {
    "type": "string",
    "format": "uri"
  },
  "title": "Output"
}