nightmareai / majesty-diffusion

Generate images from text using CLIP guided latent diffusion

  • Public
  • 8.3K runs
  • A100 (80GB)
  • GitHub

Input

string
Shift + Return to add a new line

Prompts for CLIP guidance, multiple prompts allowed, one per line. Supports weights.

Default: "The portrait of a Majestic Princess, trending on artstation\n"

string
Shift + Return to add a new line

Prompt for latent diffusion, single prompt, no weights.

Default: "The portrait of a Majestic Princess, trending on artstation"

integer

Output height (height OR width must be less than 1024 to avoid OOM)

Default: 512

integer

Output width

Default: 512

integer

Determines how much the CLIP prompts affect the image. Note the numeric scale is not the same as the latent prompt.

Default: 9500

integer

Determines how much the latent_prompt affects the image. Lower help with text interpretation, higher help with composition. Try values between 0-15. If you see too much text, lower it

integer

Aesthetic loss scale

Default: 400

number

Starting timestep

Default: 0.9

string

Latent diffusion model (recommended to stick with finetuned - ongo and erlich may need to download, taking extra time)

Default: "finetuned"

string
Shift + Return to add a new line

Negative prompt for Latent Diffusion (things to steer away from)

Default: ""

integer

Steps between outputs, 0 to disable progressive output. Minor speed impact.

Default: 10

file

Initial image (optional, used instead of noise if supplied)

file

A mask same width and height as the initial image with the color white indicating where to inpaint

integer

Controls how much the init image should influence the final result. Experiment with values around 1000

Default: 1000

number

Init image brightness

Default: 0

string
Shift + Return to add a new line

Custom settings cfg content, for custom or API usage. See Majesty docs.

Default: "\n"

Output

output
Generated in

This example was created by a different version, nightmareai/majesty-diffusion:73bfb38e.

Run time and cost

This model costs approximately $0.75 to run on Replicate, or 1 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia A100 (80GB) GPU hardware. Predictions typically complete within 9 minutes. The predict time for this model varies significantly based on the inputs.

Readme

Majesty Diffusion 👑

Generate images from text with majesty

Formerly known as Princess Generator

Majesty Diffusion are implementations of text-to-image diffusion models with a royal touch 👸

Created by @Dango233 and @apolinario from multimodal ai art

Find and contribute custom settings on the official Hugging Face Space!