lambdal / sd-naruto-diffusers

Stable Diffusion fine tuned on Naruto

  • Public
  • 2.9K runs

Input

Output

Run time and cost

This model runs on Nvidia A100 (40GB) GPU hardware. Predictions typically complete within 5 seconds.

Readme

Model weights: https://huggingface.co/lambdalabs/sd-naruto-diffusers

Naruto diffusion

Stable Diffusion fine tuned on Naruto by Lambda Labs.

Game of Thrones to Naruto

pk0.jpg

Marvel to Naruto

pk1.jpg

Prompt engineering matters

We find that prompt engineering does help produce compelling and consistent Naruto style portraits. For example, writing prompts such as ‘person_name ninja portrait’ or ‘person_name in the style of Naruto’ tends to produce results that are closer to the style of Naruto character with the characteristic headband and other elements of costume.

Here are a few examples of prompts with and without prompt engineering that will illustrate that point.

Bill Gates: pk2.jpg

Without prompt engineering

pk3.jpg

With prompt engineering

A cute bunny:

pk4.jpg

Without prompt engineering

pk4.jpg

With prompt engineering

Model description

Trained on BLIP captioned Naruto images using 2xA6000 GPUs on Lambda GPU Cloud for around 30,000 step (about 12 hours, at a cost of about $20).

Trained by Eole Cervenka after the work of Justin Pinkney (@Buntworthy) at Lambda Labs.