Readme

About

This repository provides a 8-step distilled lora for FLUX.1-dev model released by AlimamaCreative Team.

Description

This checkpoint is a 8-step distilled Lora, trained based on FLUX.1-dev model. We use a multi-head discriminator to improve the distill quality. Our model can be used for T2I, inpainting controlnet and other FLUX related models. The recommended guidance_scale=3.5 and lora_scale=1. Our Lower steps version will release later.

Text-to-Image.

With alimama-creative/FLUX.1-dev-Controlnet-Inpainting-Beta. Our distilled lora can be well adapted to the Inpainting controlnet, and the accelerated generated effect can follow the original output well.

How to use

diffusers

This model can be used directly with diffusers

comfyui

T2I turbo workflow: click here
Inpainting controlnet turbo workflow: click here

Training Details

The model is trained on 1M open source and internal sources images, with the aesthetic 6.3+ and resolution greater than 800. We use adversarial training to improve the quality. Our method fix the original FLUX.1-dev transformer as the discriminator backbone, and add multi heads to every transformer layer. We fix the guidance scale as 3.5 during training, and use the time shift as 3.

Mixed precision: bf16

Learning rate: 2e-5

Batch size: 64

Image size: 1024x1024

Licensing and commercial use

If you generate images on Replicate with FLUX.1 models and their fine-tunes, then you can use the images commercially.

If you download the weights off Replicate and generate images on your own computer, you can’t use the images commercially.

Model created over 1 year ago

Examples

Run time and cost