cjwbw / micromotion-stylegan

Decoding Micromotion in Low-dimensional Latent Spaces from StyleGAN

Public
8.2K runs
T4
GitHub
Paper
License

Run with an API

Playground API Examples README Versions

Input

Run this model in Node.js with one line of code:

npx create-replicate --model=cjwbw/micromotion-stylegan

or set up a project from scratch

Install Replicate’s Node.js client library:

npm install replicate

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Import and set up the client:

import Replicate from "replicate";

const replicate = new Replicate({
  auth: process.env.REPLICATE_API_TOKEN,
});

Run cjwbw/micromotion-stylegan using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

const output = await replicate.run(
  "cjwbw/micromotion-stylegan:e662d9732001aa4f30c86927ac39c0da6b8a371fc5391931171a6428bd34c27f",
  {
    input: {
      image: "https://replicate.delivery/mgxm/9e36eba2-1e6c-4f1e-88d2-e9afa6b24728/van_gouh.jpeg",
      scale: 5,
      micromotion: "eyesClose"
    }
  }
);

// To access the file URL:
console.log(output.url()); //=> "http://example.com"

// To write the file to disk:
fs.writeFile("my-image.png", output);

To learn more, take a look at the guide on getting started with Node.js.

Install Replicate’s Python client library:

pip install replicate

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Import the client:

import replicate

Run cjwbw/micromotion-stylegan using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

output = replicate.run(
    "cjwbw/micromotion-stylegan:e662d9732001aa4f30c86927ac39c0da6b8a371fc5391931171a6428bd34c27f",
    input={
        "image": "https://replicate.delivery/mgxm/9e36eba2-1e6c-4f1e-88d2-e9afa6b24728/van_gouh.jpeg",
        "scale": 5,
        "micromotion": "eyesClose"
    }
)
print(output)

To learn more, take a look at the guide on getting started with Python.

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Run cjwbw/micromotion-stylegan using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

curl -s -X POST \
  -H "Authorization: Bearer $REPLICATE_API_TOKEN" \
  -H "Content-Type: application/json" \
  -H "Prefer: wait" \
  -d $'{
    "version": "cjwbw/micromotion-stylegan:e662d9732001aa4f30c86927ac39c0da6b8a371fc5391931171a6428bd34c27f",
    "input": {
      "image": "https://replicate.delivery/mgxm/9e36eba2-1e6c-4f1e-88d2-e9afa6b24728/van_gouh.jpeg",
      "scale": 5,
      "micromotion": "eyesClose"
    }
  }' \
  https://api.replicate.com/v1/predictions

To learn more, take a look at Replicate’s HTTP API reference docs.

Output

Generated in

26.3 seconds

Examples

View more examples

Run time and cost

This model costs approximately $0.048 to run on Replicate, or 20 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia T4 GPU hardware. Predictions typically complete within 4 minutes. The predict time for this model varies significantly based on the inputs.

Readme

This is a cog implementation of https://github.com/wuqiuche/micromotion-styleGAN

Grasping the Arrow of Time from the Singularity: Decoding Micromotion in Low-dimensional Latent Spaces from StyleGAN

Project Page | Paper

Qiucheng Wu Yifan Jiang Junru Wu Kai Wang Gong Zhang Humphrey Shi Zhangyang Wang Shiyu Chang

This is the official implementation of the paper “Grasping the Arrow of Time from the Singularity: Decoding Micromotion in Low-dimensional Latent Spaces from StyleGAN”.

Overview

In this work, we hypothesize and demonstrate that a series of meaningful, natural, and versatile small, local movements (referred to as “micromotion”, such as expression, head movement, and aging effect) can be represented in low-rank spaces extracted from the latent space of a conventionally pre-trained StyleGAN-v2 model for face generation, with the guidance of proper “anchors” in the form of either short text or video clips. Starting from one target face image, with the editing direction decoded from the low-rank space, its micromotion features can be represented as simple as an affine transformation over its latent feature. Perhaps more surprisingly, such micromotion subspace, even learned from just single target face, can be painlessly transferred to other unseen face images, even those from vastly different domains (such as oil painting, cartoon, and sculpture faces).

The workflow

Our complete workflow can be distilled down to three simple steps: (a) collecting anchor latent codes from a single identity; (b) enforcing robustness linear decomposition to obtain a noise-free low-dimensional space; (c) applying the extracted edit direction from low-dimensional space to arbitrary input identities.