mchong6/jojogan

Public
JoJoGAN: One Shot Face Stylization
13.9K runs

Run time and cost

Predictions run on Nvidia T4 GPU hardware. Predictions typically complete within 120 seconds. The predict time for this model varies significantly based on the inputs.

JoJoGAN: One Shot Face Stylization

Abstract

While there have been recent advances in few-shot image stylization, these methods fail to capture stylistic details that are obvious to humans. Details such as the shape of the eyes, the boldness of the lines, are especially difficult for a model to learn, especially so under a limited data setting. In this work, we aim to perform one-shot image stylization that gets the details right. Given a reference style image, we approximate paired real data using GAN inversion and finetune a pretrained StyleGAN using that approximate paired data. We then encourage the StyleGAN to generalize so that the learned style can be applied to all other images.

Authors

Min Jin Chong and David Forsyth

Acknowledgments

This code borrows from StyleGAN2 by rosalinity, e4e. Some snippets of colab code from StyleGAN-NADA

Citation

@article{chong2021jojogan,
  title={JoJoGAN: One Shot Face Stylization},
  author={Chong, Min Jin and Forsyth, David},
  journal={arXiv preprint arXiv:2112.11641},
  year={2021}
}

Replicate