fofr / wan2.1-with-lora

Run Wan2.1 14b or 1.3b with a lora

  • Public
  • 15.2K runs
  • H100
  • GitHub
  • License

Input

*string
Shift + Return to add a new line

Text prompt for video generation

string
Shift + Return to add a new line

Things you do not want to see in your video

Default: ""

file

Image to use as a starting frame for image to video generation.

string

The aspect ratio of the video. 16:9, 9:16, 1:1, etc.

Default: "16:9"

integer

The number of frames to generate (1 to 5 seconds)

Default: 81

string

The model to use. 1.3b is faster, but 14b is better quality. A LORA either works with 1.3b or 14b, depending on the version it was trained on.

Default: "14b"

string

The resolution of the video. 720p is not supported for 1.3b.

Default: "480p"

string
Shift + Return to add a new line

Optional: The URL of a LORA to use

number

Strength of the LORA applied to the model. 0.0 is no LORA.

Default: 1

number

Strength of the LORA applied to the CLIP model. 0.0 is no LORA.

Default: 1

string

Speed up generation with different levels of acceleration. Faster modes may degrade quality somewhat. The speedup is dependent on the content, so different videos may see different speedups.

Default: "Balanced"

integer
(minimum: 1, maximum: 60)

Number of generation steps. Fewer steps means faster generation, at the expensive of output quality. 30 steps is sufficient for most prompts

Default: 30

number
(minimum: 0, maximum: 10)

Higher guide scale makes prompt adherence better, but can reduce variation

Default: 5

number
(minimum: 0, maximum: 10)

Sample shift factor

Default: 8

integer

Set a seed for reproducibility. Random by default.

Output

Generated in

This output was created using a different version of the model, fofr/wan2.1-with-lora:0615656d.

Run time and cost

This model costs approximately $5.14 to run on Replicate, or 0 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on Nvidia H100 GPU hardware. Predictions typically complete within 57 minutes. The predict time for this model varies significantly based on the inputs.

Readme

This model doesn't have a readme.