This trainer uses a single training script that is compatible with both SDXL and SD15.
The trainer has the following capabilities: - automatic image captioning using BLIP - automatic segmentation using CLIPseg - textual_inversion training of a new token to represent the concept - 3 training modes: “style” / “face” / “object” - Full finetuning or LoRa or Dora training modes are supported in the code - LoRa modules are possible for both unet and txt-encoders
The generated checkpoint files are compatible with ComfyUI and AUTO111