open-mmlab / pia

Personalized Image Animator

  • Public
  • 103.1K runs
  • L40S
  • GitHub
  • Paper
  • License

Input

image
*string
Shift + Return to add a new line

Input prompt.

*file

Input image

string
Shift + Return to add a new line

Things do not show in the output.

Default: "wrong white balance, dark, sketches,worst quality,low quality, deformed, distorted, disfigured, bad eyes, wrong lips, weird mouth, bad teeth, mutated hands and fingers, bad anatomy,wrong anatomy, amputation, extra limb, missing limb, floating,limbs, disconnected limbs, mutation, ugly, disgusting, bad_pictures, negative_hand-neg"

string

Choose a style

Default: "3d_cartoon"

integer

Max size (The long edge of the input image will be resized to this value, larger value means slower inference speed)

Default: 512

integer
(minimum: 1, maximum: 3)

Larger value means larger motion but less identity consistency.

Default: 1

integer
(minimum: 10, maximum: 100)

Number of denoising steps

Default: 25

integer
(minimum: 8, maximum: 24)

Length of the output

Default: 16

number
(minimum: 1, maximum: 20)

Scale for classifier-free guidance

Default: 7.5

number
(minimum: 0, maximum: 1)

Scale for classifier-free guidance

Default: 0

integer

Random seed. Leave blank to randomize the seed

Output

Generated in

Run time and cost

This model costs approximately $0.070 to run on Replicate, or 14 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia L40S GPU hardware. Predictions typically complete within 72 seconds. The predict time for this model varies significantly based on the inputs.

Readme

PIA:Personalized Image Animator

PIA is a personalized image animation method which can generate videos with high motion controllability and strong text and image alignment.

AnimateBench

We have open-sourced AnimateBench on HuggingFace which includes images, prompts and configs to evaluate PIA and other image animation methods.

Contact Us

Yiming Zhang: zhangyiming@pjlab.org.cn

Zhening Xing: xingzhening@pjlab.org.cn

Yanhong Zeng: zengyanhong@pjlab.org.cn

Acknowledgements

The code is built upon AnimateDiff, Tune-a-Video and PySceneDetect