fofr / ays-text-to-image

Uses 'Align your steps' for faster higher quality images

  • Public
  • 5K runs
  • L40S
  • GitHub
  • Paper
  • License

Input

string
Shift + Return to add a new line

Default: "a photo of an astronaut riding a unicorn"

string
Shift + Return to add a new line

The negative prompt to guide image generation.

Default: ""

string

The SDXL model to use for generation

Default: "albedobaseXL_v21.safetensors"

integer

Default: 1024

integer

Default: 1024

integer
(minimum: 1, maximum: 10)

Number of outputs

Default: 1

string

An enumeration.

Default: "euler"

number
(minimum: 0, maximum: 30)

Scale for classifier-free guidance

Default: 7.5

integer
(minimum: 10, maximum: 100)

Number of diffusion steps. (A minimum of 10 with AYS)

Default: 10

integer
string

Format of the output images

Default: "webp"

integer
(minimum: 0, maximum: 100)

Quality of the output images, from 0 to 100. 100 is best quality, 0 is lowest quality.

Default: 80

Output

output
Generated in

Run time and cost

This model costs approximately $0.041 to run on Replicate, or 24 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia L40S GPU hardware. Predictions typically complete within 43 seconds. The predict time for this model varies significantly based on the inputs.