Input

original_prompt

string

Shift + Return to add a new line

A painting of a squirrel eating a burgerA painting of a squirrel eating a burger

Input prompt used for the orinigal image

Default: "pink bear riding a bicycle"

prompt_edit_type

string

Choose the type of the prompt editing. See below for more information. If you are generating the original output, leave this empty.

edited_prompt

string

Shift + Return to add a new line

Prompted used for editing the original sd output image. If prompt_edit_type above is not set, then this field will be ignored. See below for more information for how to edit the prompt from the original prompt. For Re-weight, just provided words in proginal_prompt with new weights.

local_edit

string

Shift + Return to add a new line

Enable local editing. Provide the in the format of 'words_in_original_prompt | words_in_edited_prompt', and the rest content will be preserved.

cross_replace_steps

number

(minimum: 0, maximum: 1)

Cross attention replace steps

Default: 0.8

self_replace_steps

number

(minimum: 0, maximum: 1)

Self attention replace steps

Default: 0.4

seed

integer

Random seed. Leave blank to randomize the seed for original output. But make sure to use the same seed for original-updated prompt pair.

Default: 8888

Run this model in Node.js with one line of code:

npx create-replicate --model=cjwbw/prompt-to-prompt

or set up a project from scratch

Install Replicate’s Node.js client library:

npm install replicate

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Import and set up the client:

import Replicate from "replicate";

const replicate = new Replicate({
  auth: process.env.REPLICATE_API_TOKEN,
});

Run cjwbw/prompt-to-prompt using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

const output = await replicate.run(
  "cjwbw/prompt-to-prompt:77f9e56f3c0eb7e635d0197e192980173a48f414499ed07bbc80d5807bdb6191",
  {
    input: {
      seed: 8888,
      edited_prompt: "A painting of a lion eating a burger",
      original_prompt: "A painting of a squirrel eating a burger",
      prompt_edit_type: "Replacement",
      self_replace_steps: 0.4,
      cross_replace_steps: 0.8
    }
  }
);

console.log(output);

To learn more, take a look at the guide on getting started with Node.js.

Install Replicate’s Python client library:

pip install replicate

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Import the client:

import replicate

Run cjwbw/prompt-to-prompt using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

output = replicate.run(
    "cjwbw/prompt-to-prompt:77f9e56f3c0eb7e635d0197e192980173a48f414499ed07bbc80d5807bdb6191",
    input={
        "seed": 8888,
        "edited_prompt": "A painting of a lion eating a burger",
        "original_prompt": "A painting of a squirrel eating a burger",
        "prompt_edit_type": "Replacement",
        "self_replace_steps": 0.4,
        "cross_replace_steps": 0.8
    }
)

print(output)

To learn more, take a look at the guide on getting started with Python.

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Run cjwbw/prompt-to-prompt using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

curl -s -X POST \
  -H "Authorization: Bearer $REPLICATE_API_TOKEN" \
  -H "Content-Type: application/json" \
  -H "Prefer: wait" \
  -d $'{
    "version": "cjwbw/prompt-to-prompt:77f9e56f3c0eb7e635d0197e192980173a48f414499ed07bbc80d5807bdb6191",
    "input": {
      "seed": 8888,
      "edited_prompt": "A painting of a lion eating a burger",
      "original_prompt": "A painting of a squirrel eating a burger",
      "prompt_edit_type": "Replacement",
      "self_replace_steps": 0.4,
      "cross_replace_steps": 0.8
    }
  }' \
  https://api.replicate.com/v1/predictions

To learn more, take a look at Replicate’s HTTP API reference docs.

You can run this model locally using Cog. First, install Cog:

brew install cog

If you don’t have Homebrew, there are other installation options available.

Run this to download the model and run it in your local environment:

cog predict r8.im/chenxwh/prompt-to-prompt@sha256:77f9e56f3c0eb7e635d0197e192980173a48f414499ed07bbc80d5807bdb6191 \
  -i 'seed=8888' \
  -i 'edited_prompt="A painting of a lion eating a burger"' \
  -i 'original_prompt="A painting of a squirrel eating a burger"' \
  -i 'prompt_edit_type="Replacement"' \
  -i 'self_replace_steps=0.4' \
  -i 'cross_replace_steps=0.8'

To learn more, take a look at the Cog documentation.

Run this to download the model and run it in your local environment:

docker run -d -p 5000:5000 --gpus=all r8.im/chenxwh/prompt-to-prompt@sha256:77f9e56f3c0eb7e635d0197e192980173a48f414499ed07bbc80d5807bdb6191
curl -s -X POST \
  -H "Content-Type: application/json" \
  -d $'{
    "input": {
      "seed": 8888,
      "edited_prompt": "A painting of a lion eating a burger",
      "original_prompt": "A painting of a squirrel eating a burger",
      "prompt_edit_type": "Replacement",
      "self_replace_steps": 0.4,
      "cross_replace_steps": 0.8
    }
  }' \
  http://localhost:5000/predictions

To learn more, take a look at the Cog documentation.

Output

original_sd

with_prompt_to_prompt

Generated in

1 minute 35 seconds

Tweak itReport View full prediction

Examples

View more examples

Run time and cost

This model costs approximately $0.030 to run on Replicate, or 33 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia T4 GPU hardware. Predictions typically complete within 135 seconds. The predict time for this model varies significantly based on the inputs.

Readme

Prompt-to-Prompt

Stable Diffusion Implementation

Code for the demo is here https://github.com/chenxwh/prompt-to-prompt

Tips for the demo input above:

Prompt-to-prompt enables editing a stable-diffusion generated image original_image, generated with original_prompt, by editing the original_prompt only edited_prompt.

If you do not already have an original_image - original_prompt pair to play around with the editing, you can generate one by only giving value for original_prompt, set to None. It is best to set a seed (or remember the random seed assigned), which will be used for generating images with edited_prompt .

Now with original_prompt (and original_image in mind), there are three options for editing the prompt. Refer to the instructions below for each type of the editing. If you choose Re-weight, in the edited_prompt field only provide the weights assigned for words from the original_prompt, in the format of [list of words] | [list of weights]. The example gallery may come helpful!

Additionally, there is the local_edit option, in the format of words in original_prompt | words in edited_prompt, which allows you to specify the only words (semantics) that will be edited.

teaser

Prompt Edits

In our notebooks, we perform our main logic by implementing the abstract class AttentionControl object, of the following form:

class AttentionControl(abc.ABC):
    @abc.abstractmethod
    def forward (self, attn, is_cross: bool, place_in_unet: str):
        raise NotImplementedError

The forward method is called in each attention layer of the diffusion model during the image generation, and we use it to modify the weights of the attention. Our method (See Section 3 of our paper) edits images with the procedure above, and each different prompt edit type modifies the weights of the attention in a different manner.

Replacement

In this case, the user swaps tokens of the original prompt with others, e.g., the editing the prompt "A painting of a squirrel eating a burger" to "A painting of a squirrel eating a lasagna" or "A painting of a lion eating a burger". For this we define the class AttentionReplace.

In this case, the user adds new tokens to the prompt, e.g., editing the prompt "A painting of a squirrel eating a burger" to "A watercolor painting of a squirrel eating a burger". For this we define the class AttentionEditRefine.

Re-weight

In this case, the user changes the weight of certain tokens in the prompt, e.g., for the prompt "A photo of a poppy field at night", strengthen or weaken the extent to which the word night affects the resulting image. For this we define the class AttentionReweight.

Attention Control Options

cross_replace_steps: specifies the fraction of steps to edit the cross attention maps. Can also be set to a dictionary [str:float] which specifies fractions for different words in the prompt.
self_replace_steps: specifies the fraction of steps to replace the self attention maps.
local_blend (optional): LocalBlend object which is used to make local edits. LocalBlend is initialized with the words from each prompt that correspond with the region in the image we want to edit.
equalizer: used for attention Re-weighting only. A vector of coefficients to multiply each cross-attention weight

Citation

@article{hertz2022prompt,
  title = {Prompt-to-Prompt Image Editing with Cross Attention Control},
  author = {Hertz, Amir and Mokady, Ron and Tenenbaum, Jay and Aberman, Kfir and Pritch, Yael and Cohen-Or, Daniel},
  journal = {arXiv preprint arXiv:2208.01626},
  year = {2022},
}