chenxwh / diffsynth-exvideo

Extended video synthesis model that generates 128 frames

  • Public
  • 197 runs
  • GitHub
  • Paper
  • License

DiffSynth Studio

Introduction

DiffSynth Studio is a Diffusion engine. We have restructured architectures including Text Encoder, UNet, VAE, among others, maintaining compatibility with models from the open-source community while enhancing computational performance. We provide many interesting features. Enjoy the magic of Diffusion models!

This demo supports ExVideo

Long Video Synthesis

We trained an extended video synthesis model, which can generate 128 frames

https://github.com/modelscope/DiffSynth-Studio/assets/35051019/d97f6aa9-8064-4b5b-9d49-ed6001bb9acc