lambdal / image-mixer

Image Mixer Stable Diffusion

  • Public
  • 12.4K runs
  • GitHub
  • License

Run time and cost

This model costs approximately $0.083 to run on Replicate, or 12 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia A40 (Large) GPU hardware. Predictions typically complete within 115 seconds. The predict time for this model varies significantly based on the inputs.

Readme

Image Mixer

Image Mixer is a fine tuned version of Stable Diffusion Image Variations that has been trained to accept multiple CLIP embedding concatenated along the sequence dimension (as opposed to 1 in the original model). At inference time, Image Mixer combines the image embeddings from multiple images (or text) to mix their concepts. The model was trained on a subset of LAION Improved Aesthetics at a resolution of 640x640.

How to Use Image Mixer

Image Mixer lets you fuse the concepts of two or more images. To get started, upload images you want to mix and select the mixing strengths of both images.

Model Details

  • Developed by: Justin Pinkney at Lambda Labs
  • Model type: Diffusion-based image-to-image generative model
  • License: MIT License
  • Resources for more information: Check out the original GitHub Repository and the Hugging Face Demo.