pschaldenbrand/text2video – Run with an API on Replicate

This is a method for generating videos from language descriptions. The video is generated by looping through the given text prompts. Frames are generated around 1 frame per second.

More info here: https://pschaldenbrand.github.io/text2video/

Fast Text2Video

By optimizing the pixels of the video’s frames directly, rather than using a pre-trained generator model, this method is near real-time video generation. An image-to-image translation model is used to denoise the frames that were directly optimized.

From the Bot Intelligence Group at Carnegie Mellon University

This method is to be featured at the 2022 NeurIPS Workshop on Machine Learning for Creativity and Design.