cjwbw / diffae

Image Manipulatinon with Diffusion Autoencoders

Demo API Examples Versions (5d917b91)


View more examples

Run time and cost

Predictions run on Nvidia T4 GPU hardware. Predictions typically complete within 45 seconds. The predict time for this model varies significantly based on the inputs.

This is a cog implementation of face manipulation from https://github.com/phizaz/diffae

Official implementation of Diffusion Autoencoders

A CVPR 2022 (ORAL) paper (paper, site, 5-min video):

      title={Diffusion Autoencoders: Toward a Meaningful and Decodable Representation}, 
      author={Preechakul, Konpat and Chatthee, Nattanat and Wizadwongsa, Suttisak and Suwajanakorn, Supasorn},
      booktitle={IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, 
Original in imgs directory
Aligned with align.py
Using manipulate.ipynb