fofr / sd3-explorer

A model for experimenting with all the SD3 settings. Non-commercial use only, unless you have a Stability AI Self Hosted License.

  • Public
  • 32.2K runs
  • L40S
  • GitHub
  • Paper
  • License

Input

string
Shift + Return to add a new line

This prompt is ignored when using the triple prompt mode. See below.

Default: ""

string

Pick whether to use T5-XXL in fp16, fp8 or not at all. We recommend fp16 for this model as it has the best image quality. When running locally we recommend fp8 for lower memory usage. We've included all versions here for exploration.

Default: "sd3_medium_incl_clips_t5xxlfp16.safetensors"

integer

The width of the image (best output at ~1 megapixel. Resolution must be divisible by 64)

Default: 1024

integer

The height of the image (best output at ~1 megapixel. Resolution must be divisible by 64)

Default: 1024

integer

The number of steps to run the model for (more steps = better image but slower generation. Best results for this model are around 26 to 36 steps.)

Default: 28

string

The sampler to use (used to manage noise)

Default: "dpmpp_2m"

string

The scheduler to use (used to manage noise; do not use karras)

Default: "sgm_uniform"

number
(minimum: 0, maximum: 20)

The timestep scheduling shift; shift values higher than 1.0 are better at managing noise in higher resolutions. Try values 6.0 and 2.0 to experiment with effects.

Default: 3

number
(minimum: 0, maximum: 20)

The guidance scale tells the model how similar the output should be to the prompt. (Recommend between 3.5 and 4.5; if images look 'burnt,' lower the value.)

Default: 3.5

integer
(minimum: 1, maximum: 10)

The number of images to generate

Default: 1

boolean

Default: false

string
Shift + Return to add a new line

The prompt that will be passed to just the CLIP-G model.

Default: ""

string
Shift + Return to add a new line

The prompt that will be passed to just the CLIP-L model.

Default: ""

string
Shift + Return to add a new line

The prompt that will be passed to just the T5-XXL model.

Default: ""

boolean

Whether to add padding for empty prompts. Useful if you only want to pass a prompt to one or two of the three text encoders. Has no effect when all prompts are filled. Disable this for interesting effects.

Default: true

string
Shift + Return to add a new line

Negative prompts do not really work in SD3. This will simply cause your output image to vary in unpredictable ways.

Default: ""

number
(minimum: 0, maximum: 1)

When the negative conditioning should stop being applied. By default it is disabled. If you want to try a negative prompt, start with a value of 0.1

Default: 0

string

Format of the output images

Default: "webp"

integer
(minimum: 0, maximum: 100)

Quality of the output images, from 0 to 100. 100 is best quality, 0 is lowest quality.

Default: 80

integer

Set a seed for reproducibility. Random by default.

Output

output
Generated in

This output was created using a different version of the model, fofr/sd3-explorer:7c48d3a1.

Run time and cost

This model costs approximately $0.0044 to run on Replicate, or 227 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia L40S GPU hardware. Predictions typically complete within 5 seconds.