cjwbw / gta5_artwork_diffusion

GTA5 Artwork Diffusion via Dreambooth

  • Public
  • 4.9K runs
  • A100 (80GB)

Input

string
Shift + Return to add a new line

Input prompt

Default: ""

string
Shift + Return to add a new line

Specify things to not see in the output

integer

Width of output image. Maximum size is 1024x768 or 768x1024 because of memory limits

Default: 768

integer

Height of output image. Maximum size is 1024x768 or 768x1024 because of memory limits

Default: 768

number

Prompt strength when using init image. 1.0 corresponds to full destruction of information in init image

Default: 0.8

integer
(minimum: 1, maximum: 4)

Number of images to output.

Default: 1

integer
(minimum: 1, maximum: 500)

Number of denoising steps

Default: 50

number
(minimum: 1, maximum: 20)

Scale for classifier-free guidance

Default: 7.5

string

Choose a scheduler.

Default: "DPMSolverMultistep"

integer

Random seed. Leave blank to randomize the seed

Output

output
Generated in

Run time and cost

This model costs approximately $0.072 to run on Replicate, or 13 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia A100 (80GB) GPU hardware. Predictions typically complete within 52 seconds. The predict time for this model varies significantly based on the inputs.

Readme

weights from: https://huggingface.co/ItsJayQz/GTA5_Artwork_Diffusion

GTA5 Artwork Diffusion

This model was trained on the loading screens, gta storymode, and gta online DLCs artworks. Which includes characters, background, chop, and some objects. The model can do people and portrait pretty easily, as well as cars, and houses. For some reasons, the model stills automatically include in some game footage, so landscapes tend to look a bit more game-like. Please check out important informations on the usage of the model down bellow.

To reference the art style, use the token: name* in gtav style

There is already an existing model that uses textual inversion. This is trained using Dreambooth instead, whether or not this method is better, I will let you judge.

License - This model is under Creative OpenRAIL-M. - This means the model can be used royalty-free, and flexible with the model usage, such as redistribution of the model, or of any derivatives of the model. - However, there are restrictions on the openess of the license. More info into the restrictions can be found here.

Responsibilities - By using/downloading the model, you are responsible for: - All outputs/usage of the model. - Understanding the Disclaimers. - Upholding the terms of the license.

Thanks for checking out the model!

Portraits kreeves.png gta1.png gta3.png