wcarle / stable-diffusion-videos-mo-di

Generate videos by interpolating the latent space of Stable Diffusion using the Mo-Di Diffusion Model

  • Public
  • 2.8K runs
  • GitHub
  • License



Run time and cost

This model runs on Nvidia A100 (40GB) GPU hardware. Predictions typically complete within 8 minutes. The predict time for this model varies significantly based on the inputs.



Based on nateraw’s project: https://replicate.com/nateraw/stable-diffusion-videos https://github.com/nateraw/stable-diffusion-videos

Swapped out the standard stable diffusion model with Mo Di Diffusion: https://huggingface.co/nitrosocke/mo-di-diffusion

Stable-diffusion-videos allows you to generate videos by interpolating the latent space of Stable Diffusion.

You can either dream up different versions of the same prompt, or morph between different text prompts (with seeds set for each for reproducibility).