Minimal and Universal Control for Diffusion Transformer - demo for Subject-driven generation

Public

2.1K runs

License

GitHub

Weights

Paper

Playground API Examples README Versions

Examples

View more examples

Run time and cost

This model costs approximately $0.079 to run on Replicate, or 12 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia L40S GPU hardware. Predictions typically complete within 82 seconds. The predict time for this model varies significantly based on the inputs.

Readme

OminiControl: Minimal and Universal Control for Diffusion Transformer

This is the demo for Subject-driven generation. See https://replicate.com/chenxwh/ominicontrol-spatial for Spatially aligned control.

Features

OminiControl is a minimal yet powerful universal control framework for Diffusion Transformer models like FLUX.

Universal Control 🌐: A unified control framework that supports both subject-driven control and spatial control (such as edge-guided and in-painting generation).
Minimal Design 🚀: Injects control signals while preserving original model structure. Only introduces 0.1% additional parameters to the base model.

Citation

@article{
  tan2024omini,
  title={OminiControl: Minimal and Universal Control for Diffusion Transformer},
  author={Zhenxiong Tan, Songhua Liu, Xingyi Yang, Qiaochu Xue, and Xinchao Wang},
  journal={arXiv preprint arXiv:2411.15098},
  year={2024}
}

Model created over 1 year ago