zsxkib / allegro

Powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text prompt

  • Public
  • 130 runs
  • A100 (80GB)
  • GitHub
  • Weights
  • Paper
  • License

Input

*string
Shift + Return to add a new line

Text prompt for video generation

number
(minimum: 0, maximum: 20)

Guidance scale

Default: 7.5

integer
(minimum: 10, maximum: 100)

Number of sampling steps

Default: 20

boolean

Enable CPU offload (reduces GPU memory usage but increases inference time)

Default: false

integer
(minimum: 16, maximum: 128)

Number of frames to generate

Default: 88

integer
(minimum: 256, maximum: 1024)

Height of the generated video

Default: 720

integer
(minimum: 256, maximum: 1920)

Width of the generated video

Default: 1280

integer
(minimum: 1, maximum: 60)

Frames per second for the output video

Default: 15

integer

Random seed (set to None for random)

Output

Generated in

Run time and cost

This model runs on Nvidia A100 (80GB) GPU hardware. We don't yet have enough runs of this model to provide performance information.

Readme

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.