Generate videos by interpolating the latent space of Stable Diffusion using the Mo-Di Diffusion Model

Public

2.8K runs

License

GitHub

Examples

Run time and cost

This model costs approximately $0.62 to run on Replicate, or 1 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia A100 (80GB) GPU hardware. Predictions typically complete within 8 minutes. The predict time for this model varies significantly based on the inputs.

Readme

stable-diffusion-videos-mo-di

Based on nateraw’s project: https://replicate.com/nateraw/stable-diffusion-videos https://github.com/nateraw/stable-diffusion-videos

Swapped out the standard stable diffusion model with Mo Di Diffusion: https://huggingface.co/nitrosocke/mo-di-diffusion

Stable-diffusion-videos allows you to generate videos by interpolating the latent space of Stable Diffusion.

You can either dream up different versions of the same prompt, or morph between different text prompts (with seeds set for each for reproducibility).

Model created over 1 year ago