genmoai/mochi-1-lora-trainer | Run with an API on Replicate

Readme

About

A Cog implementation of a-r-r-o-w/cogvideox-factory for Mochi-1 LoRA training

How to use

You must include a zip file of mov/mp4 video snippets. It is recommended, but not required, to include captions for each video in separate txt files. Note that a lack of captions may hurt fine-tuning quality.

Feel free to use any video captioning model to caption your videos. For a list of models, see our Caption Video collection here. Captions should be fairly detailed, ideally more than 50 words per video.

Make sure each input video is no longer than 2 seconds long each. Use this tool to help you split up video files: video-split

Example use case: VHS effect

Lets say we want to train a VHS video effect lora that looks like this: Luis Quintero Noise-Video

First steps is to caption the video with apollo-7b, or any other video captioner model
Now that you have a txt file you can split up the large video into smaller 2.5 second snippets with this tool: mochi1-video-split
Now train your VHS LoRA using the same settings as this training run: mochi-lora-vhs
Finally, test out your LoRA with this LoRA Explorer to see the effect! Be sure to use similar words that are found in your caption txt file. Here is an example run of the VHS LoRA

Example Trained LoRAs:

Under the examples tab you will see the following Mochi-1 LoRAs trained with this model:

Example VHS effect video LoRA
Example black and white Mickey mouse video LoRA

How to Run your LoRA: