zsxkib/allegro

Powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text prompt

Public
144 runs

Input

*string
Shift + Return to add a new line
Text prompt for video generation
number
(minimum: 0, maximum: 20)
Guidance scale

Default: 7.5

integer
(minimum: 10, maximum: 100)
Number of sampling steps

Default: 20

boolean
Enable CPU offload (reduces GPU memory usage but increases inference time)

Default: false

integer
(minimum: 16, maximum: 128)
Number of frames to generate

Default: 88

integer
(minimum: 256, maximum: 1024)
Height of the generated video

Default: 720

integer
(minimum: 256, maximum: 1920)
Width of the generated video

Default: 1280

integer
(minimum: 1, maximum: 60)
Frames per second for the output video

Default: 15

integer
Random seed (set to None for random)

Output

Generated in

Run time and cost

This model runs on Nvidia A100 (80GB) GPU hardware. We don't yet have enough runs of this model to provide performance information.

Readme

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

kling-v1.6-standard

kwaivgi/kling-v1.6-standard

Generate short videos from text prompts. Optionally condition on a start image to create image-to-video clips and include up to 4 reference images as scene elements. Choose 5s or 10s duration with 720p output at 30fps, and set aspect ratio to 16:9, 9:16, or 1:1. Supports negative prompts to steer content and returns a video.

1.4m runs
Official
kling-v2.1

kwaivgi/kling-v2.1

Generate short videos from a start image and a text prompt. Produce 5 or 10 second clips at 24 fps in 720p (standard) or 1080p (pro). Optionally supply an end image in pro mode to guide the final frame or interpolate between start and end.

2.5m runs
Official
seedance-1-lite

bytedance/seedance-1-lite

Generate short videos from text prompts or a starting image. Produce 2–12 second clips at 24 fps in up to 1080p resolution across aspect ratios including 16:9, 4:3, 1:1, 3:4, 9:16, 21:9, and 9:21. Guide subjects, style, and multi-character interactions with 1–4 reference images for character, clothing, and environment consistency. Optionally lock the camera, set a random seed for reproducibility, and anchor start/end frames with first- and last-frame images. Outputs a video.

1.6m runs
Official

kwaivgi/kling-v2.5-turbo-pro

Generate 5–10 second videos from text prompts or a single starting image. Accept a required prompt and optional first-frame image, and output short clips with fluid motion, stable frames, and coherent pacing. Preserve color, lighting, and mood across frames with refined conditioning, and follow multi-step, causal instructions for complex camera moves. Suited for marketing assets, creator shorts, film/animation previz, and educational explainers.

740.3k runs
Official
veo-3-fast

google/veo-3-fast

Generate videos with sound from a text prompt or from a reference image plus prompt. Create 4–8 second clips at 720p or 1080p in 16:9 or 9:16, with optional native audio generation. Run fast and cost-efficiently for text-to-video storytelling, product spins, concept shots, and image-to-video animations.

128.3k runs
Official
kling-v1.6-pro

kwaivgi/kling-v1.6-pro

Generate 5–10 second 1080p videos from a text prompt. Provide a start or end image to anchor the first or last frame (one is required) and optionally add up to four reference images as scene elements. Select 16:9, 9:16, or 1:1 aspect ratios and refine results with negative prompts.

759.1k runs
Official
kling-v2.0

kwaivgi/kling-v2.0

Generate 5–10 second 720p videos from a text prompt. Optionally animate from a starting image by using it as the first frame (image-to-video). Select aspect ratios 16:9, 9:16, or 1:1. Outputs silent video.

79.2k runs
Official
wan-2.1-t2v-480p

wavespeedai/wan-2.1-t2v-480p

Generate 480p videos from a text prompt with accelerated inference. Accept a prompt plus optional LoRA weights and scaling to apply styles or characters, with controls for aspect ratio (16:9, 9:16), seed for reproducibility, negative prompt, guidance scale, sampling steps, and flow shift. Include a safety checker with an option to disable it. Output is a silent video.

179.4k runs
Official