vivalapanda / stable-diffusion-blip

  • Public
  • 796 runs
Iterate in playground

Input

string
Shift + Return to add a new line

Input prompt

Default: ""

file

Inital image to provide structural or conceptual guidance

string

Captioning model to use. One of 'blip' or 'clip-interrogator-v1'

Default: "blip"

number

Structural (standard) image strength. 0.0 corresponds to full destruction of information, and does not use the initial image for structure.

Default: 0.15

number

Conceptual image strength. 0.0 doesn't use the image conceptually at all, 1.0 only uses the image concept and ignores the prompt.

Default: 0.4

integer

Random seed. Leave blank to randomize the seed

Output

No output yet! Press "Submit" to start a prediction.

Run time and cost

This model costs approximately $0.046 to run on Replicate, or 21 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia A100 (80GB) GPU hardware. Predictions typically complete within 33 seconds. The predict time for this model varies significantly based on the inputs.

Readme

Model description

Intended use

Ethical considerations

Caveats and recommendations