learners-superpumped / dreambooth-tar

  • Public
  • 2.5K runs
  • L40S

Input

*file

A ZIP file containing your training images (JPG, PNG, etc. size not restricted). These images contain your 'subject' that you want the trained model to embed in the output domain for later generating customized scenes beyond the training images. For best results, use images without noise or unrelated objects in the background.

file

An optional ZIP file containing the training data of class images. This corresponds to `class_prompt` above, also with the purpose of keeping the model generalizable. By default, the pretrained stable-diffusion model will generate N images (determined by the `num_class_images` you set) based on the `class_prompt` provided. But to save time or to have your preferred specific set of `class_data`, you can also provide them in a ZIP file.

integer

The resolution for input images. All the images in the train/validation dataset will be resized to this resolution.

Default: 512

string
Shift + Return to add a new line

The base model for tuning

Default: "https://huggingface.co/BanKaiPls/AsianModel/resolve/main/Brav6.safetensors"

integer

Batch size (per device) for the training dataloader.

Default: 2

string
Shift + Return to add a new line

User name

Default: "testuser"

integer

Total number of training steps to perform. If provided, overrides num_train_epochs.

Default: 1760

number

Initial learning rate (after the potential warmup period) to use.

Default: 0.00002

string

The scheduler type to use

Default: "constant_with_warmup"

string
Shift + Return to add a new line

model save path

Default: "test_model"

integer

Number of steps for the warmup in the lr scheduler.

Default: 176

number

Loss weight for arcface loss

Default: 0.075

Output

No output yet! Press "Submit" to start a prediction.

Run time and cost

This model costs approximately $1.29 to run on Replicate, or 0 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia L40S GPU hardware. Predictions typically complete within 22 minutes. The predict time for this model varies significantly based on the inputs.

Readme

This model doesn't have a readme.