These models generate videos from text prompts, images, and reference materials. The field is advancing fast — most models now generate native audio alongside video.
Runway Gen-4.5 is the top-rated video generation model, ranked #1 on the Artificial Analysis text-to-video benchmark. It produces videos with realistic physics — objects have weight, liquids flow naturally, and fine details like hair and fabric stay coherent across frames. Great for polished, cinematic clips where visual fidelity matters most.
Google Veo 3.1 and Veo 3.1 Fast are strong alternatives with native audio generation. Veo 3.1 Fast is a good pick when you want high quality with quicker turnaround. Veo 3.1 Lite is a more affordable option for high-volume use.
Kling Video 3.0 generates cinematic videos up to 15 seconds with native audio — including lip-synced dialogue, sound effects, and ambient sound. Its multi-shot mode lets you define up to 6 connected scenes in a single generation, making it ideal for short narratives, product demos, and ads.
Kling Video 3.0 Omni adds reference-based generation and video editing on top. Upload reference images to keep character appearance consistent across scenes, or feed in a reference video for style and camera movement transfer.
Seedance 2.0 from ByteDance accepts up to 9 reference images, 3 video clips, and 3 audio files — all combinable in your prompt. Supports T2V, I2V, video continuation, character consistency, motion transfer, and lip-synced dialogue with intelligent duration control. Seedance 2.0 Fast trades some quality for speed.
Seedance 1.5 Pro offers cinema-quality output with multi-language lip-sync and cinematic camera movements.
Grok Imagine Video from xAI generates short video clips with synchronized audio in around 30 seconds. Multiple aspect ratios (16:9, 9:16, 1:1) make it a natural fit for TikTok, Reels, and Shorts.
Vidu Q3 Pro supports a start-end-to-video mode — provide first and last frames and it generates smooth transitions between them. Up to 16 seconds at 1080p with audio. Vidu Q3 Turbo is a faster, cheaper variant.
Hailuo 2.3 from Minimax supports both text-to-video and image-to-video with standard and pro quality tiers. Hailuo 2.3 Fast trades some quality for speed.
PixVerse v5.6 is another cost-effective choice with unit-based pricing.
PrunaAI p-video offers T2V, I2V, and audio-to-video in a single endpoint. Its draft mode generates previews 4× faster for quick iteration before final rendering. Up to 1080p at 48 FPS.
The Wan video models are excellent open-source options, competitive with many proprietary models. Wan 2.7 T2V is the newest generation with a 27 billion parameter MoE architecture. Wan 2.5 T2V and the fast variants (Wan 2.5 T2V Fast, Wan 2.5 I2V Fast) are among the quickest options on Replicate.
Generative video is a rapidly advancing field. Check out the arena and leaderboard at Artificial Analysis to see what's popular today.
Featured models
bytedance/seedance-2.0ByteDance's multimodal video generation model with native audio, multimodal reference inputs, and intelligent duration control.
Updated 2 days, 19 hours ago
64.2K runs
prunaai/p-videoFast video generation with built-in draft mode for rapid creative iteration. Text-to-video, image-to-video, and audio-to-video in a single endpoint.
Updated 4 days, 16 hours ago
614.3K runs
VEED Fabric 1.0 is an image-to-video API that turns any image into a talking video
Updated 2 months, 1 week ago
23.5K runs
Recommended Models
The Wan fast variants are among the quickest text-to-video options. Grok Imagine Video generates clips with audio in about 30 seconds. PrunaAI p-video has a draft mode that generates previews 4x faster for quick iteration. Seedance 2.0 Fast and Seedance 1 Pro Fast are speed-optimized variants of their respective models.
Hailuo 2.3 supports both text-to-video and image-to-video with standard and pro quality tiers. PixVerse v5.6 uses unit-based pricing that keeps shorter, lower-resolution videos affordable. The Wan open-source models are the cheapest option overall.
Runway Gen-4.5 is ranked #1 on the Artificial Analysis benchmark for realistic physics and visual fidelity. Google Veo 3.1 is another top choice, especially with its native audio generation.
Most current-generation models generate audio alongside video: Kling Video 3.0, Seedance 2.0, Veo 3.1, Grok Imagine Video, Vidu Q3 Pro, Wan 2.5 T2V, and PrunaAI p-video all generate synchronized audio.
Kling Video 3.0 supports multi-shot mode with up to 6 connected scenes in a single generation. Seedance 2.0 supports video continuation for building longer sequences.
The Wan video models are the strongest open-source option. Wan 2.7 T2V is the newest with a 27B parameter MoE architecture. Wan 2.5 T2V Fast is great for speed.
Most models produce 5-15 second clips. Kling Video 3.0 and Seedance 2.0 go up to 15 seconds. Vidu Q3 Pro goes up to 16 seconds. For longer content, use video extension models like Grok Imagine Video Extension to chain clips together.
Yes — most models support commercial use. Always check the license on the model page, especially for open-source models.
Recommended Models
bytedance/seedance-2.0-fastA faster variant of Seedance 2.0 for quicker video generation with multimodal inputs and native audio.
Updated 2 days, 19 hours ago
14K runs
Kling 2.5 Turbo Pro: Unlock pro-level text-to-video and image-to-video creation with smooth motion, cinematic depth, and remarkable prompt adherence.
Updated 1 week, 2 days ago
2.5M runs
New and improved version of Veo 3 Fast, with higher-fidelity video, context-aware audio and last frame support
Updated 3 weeks, 4 days ago
573.7K runs
New and improved version of Veo 3, with higher-fidelity video, context-aware audio, reference image and last frame support
Updated 3 weeks, 4 days ago
447.9K runs
Generate videos using xAI's Grok Imagine Video model
Updated 1 month, 3 weeks ago
520.6K runs
Kling Video 3.0 Omni: Unified multimodal video generation with reference images, video editing, native audio, and multi-shot control
Updated 1 month, 4 weeks ago
399.5K runs
runwayml/gen-4.5State-of-the-art video motion quality, prompt adherence and visual fidelity
Updated 2 months ago
116.4K runs
Kling Video 3.0: Generate cinematic videos up to 15 seconds with multi-shot control, native audio, and improved consistency
Updated 2 months ago
149.2K runs
Modify an existing video through natural-language commands, changing subjects, environments, and visual style while preserving the original motion and timing.
Updated 2 months, 1 week ago
9K runs
bytedance/dreamactor-m2.0Animate any character, humans, cartoons, animals, even non-humans, from a single image + driving video
Updated 2 months, 1 week ago
10K runs
Latest video model from Pixverse with astonishing physics
Updated 2 months, 3 weeks ago
18.4K runs

openai/sora-2-proOpenAI's Most advanced synced-audio video generation
Updated 2 months, 4 weeks ago
107.5K runs

openai/sora-2OpenAI's Flagship video generation with synced audio
Updated 2 months, 4 weeks ago
297.9K runs
A very fast and cheap PrunaAI optimized version of Wan 2.2 A14B text-to-video
Updated 3 months ago
262.8K runs
A very fast and cheap PrunaAI optimized version of Wan 2.2 A14B image-to-video
Updated 3 months ago
10.3M runs
Kling 2.6 Pro: Top-tier image-to-video with cinematic visuals, fluid motion, and native audio generation
Updated 3 months, 2 weeks ago
574.5K runs
Alibaba Wan 2.5 text to video generation model
Updated 4 months, 2 weeks ago
34K runs
Alibaba Wan 2.5 Image to video generation with background audio
Updated 4 months, 2 weeks ago
208.4K runs

Sound on: Google’s flagship Veo 3 text to video model, with audio
Updated 4 months, 3 weeks ago
228.3K runs

A faster and cheaper version of Google’s Veo 3 video model, with audio
Updated 4 months, 3 weeks ago
186.3K runs

State of the art video generation model. Veo 2 can faithfully follow simple and complex instructions, and convincingly simulates real-world physics as well as a wide range of visual styles.
Updated 4 months, 3 weeks ago
107.7K runs

Quickly generate smooth 5s or 8s videos at 540p, 720p or 1080p
Updated 4 months, 3 weeks ago
44.1K runs

Quickly make 5s or 8s videos at 540p, 720p or 1080p. It has enhanced motion, prompt coherence and handles complex actions well.
Updated 4 months, 3 weeks ago
257.7K runs

Create 5s-8s videos with enhanced character movement, visual effects, and exclusive 1080p-8s support. Optimized for anime characters and complex actions
Updated 4 months, 3 weeks ago
777.3K runs
Wan 2.5 text-to-video, optimized for speed
Updated 4 months, 3 weeks ago
48.2K runs

Accelerated inference for Wan 2.1 14B text to video with high resolution, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.
Updated 4 months, 3 weeks ago
36.6K runs

Accelerated inference for Wan 2.1 14B image to video with high resolution, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.
Updated 4 months, 3 weeks ago
88.2K runs

Accelerated inference for Wan 2.1 14B image to video, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.
Updated 4 months, 3 weeks ago
446.8K runs

bytedance/seedance-1-proA pro version of Seedance that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 1080p resolution
Updated 5 months, 1 week ago
1.9M runs

bytedance/seedance-1-liteA video generation model that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 720p resolution
Updated 5 months, 1 week ago
3.1M runs
bytedance/seedance-1-pro-fastA faster and cheaper version of Seedance 1 Pro
Updated 5 months, 1 week ago
1.3M runs

Create 5s 480p videos from a text prompt
Updated 5 months, 1 week ago
11.1K runs

Generate 5s and 10s videos in 720p resolution
Updated 5 months, 1 week ago
95.4K runs

Generate 5s and 10s videos in 1080p resolution
Updated 5 months, 1 week ago
822.3K runs

A premium version of Kling v2.1 with superb dynamics and prompt adherence. Generate 1080p 5s and 10s videos from text or an image
Updated 5 months, 1 week ago
98.3K runs

Generate 5s and 10s videos in 720p resolution at 30fps
Updated 5 months, 1 week ago
1.6M runs

Use Kling v2.1 to generate 5s and 10s videos in 720p and 1080p resolution from a starting image (image-to-video)
Updated 5 months, 1 week ago
3.9M runs

luma/ray-2-540pGenerate 5s and 9s 540p videos
Updated 5 months, 1 week ago
11.6K runs

luma/ray-2-720pGenerate 5s and 9s 720p videos
Updated 5 months, 1 week ago
40K runs
Wan 2.5 image-to-video, optimized for speed
Updated 5 months, 1 week ago
63.8K runs

Accelerated inference for Wan 2.1 14B text to video, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.
Updated 5 months, 1 week ago
190.5K runs

luma/ray-flash-2-720pGenerate 5s and 9s 720p videos, faster and cheaper than Ray 2
Updated 5 months, 1 week ago
48.3K runs

Generate 6s videos with prompts or images. (Also known as Hailuo). Use a subject reference to make a video with a character and the S2V-01 model.
Updated 5 months, 1 week ago
695.3K runs

Generate videos with specific camera movements
Updated 5 months, 1 week ago
76K runs
A high-fidelity video generation model optimized for realistic human motion, cinematic VFX, expressive characters, and strong prompt and style adherence across both text-to-video and image-to-video workflows
Updated 5 months, 1 week ago
75K runs
A lower-latency image-to-video version of Hailuo 2.3 that preserves core motion quality, visual consistency, and stylization performance while enabling faster iteration cycles.
Updated 5 months, 1 week ago
112.2K runs

An image-to-video (I2V) model specifically trained for Live2D and general animation use cases
Updated 5 months, 1 week ago
184K runs
luma/ray-flash-2-540pGenerate 5s and 9s 540p videos, faster and cheaper than Ray 2
Updated 5 months, 1 week ago
67.5K runs
Hailuo 2 is a text-to-video and image-to-video model that can make 6s or 10s videos at 768p (standard) or 1080p (pro). It excels at real world physics.
Updated 5 months, 1 week ago
367.6K runs

Image-to-video at 720p and 480p with Wan 2.2 A14B
Updated 8 months, 1 week ago
53.2K runs
fofr/not-realMake a very realistic looking real-world AI video
Updated 9 months ago
2.3K runs

Generate 5s 480p videos. Wan is an advanced and powerful visual generation model developed by Tongyi Lab of Alibaba Group
Updated 1 year, 1 month ago
48.8K runs

tencent/hunyuan-videoA state-of-the-art text-to-video generation model capable of creating high-quality videos with realistic motion from text descriptions
Updated 1 year, 2 months ago
117.6K runs

lightricks/ltx-videoLTX-Video is the first DiT-based video generation model capable of generating high-quality videos in real-time. It produces 24 FPS videos at a 768x512 resolution faster than they can be watched.
Updated 1 year, 3 months ago
168.9K runs

zsxkib/hunyuan-video2videoA state-of-the-art text-to-video generation model capable of creating high-quality videos with realistic motion from text descriptions
Updated 1 year, 4 months ago
3K runs

genmoai/mochi-1Mochi 1 preview is an open video generation model with high-fidelity motion and strong prompt adherence in preliminary evaluation
Updated 1 year, 4 months ago
3.3K runs

zsxkib/pyramid-flowText-to-Video + Image-to-Video: Pyramid Flow Autoregressive Video Generation method based on Flow Matching
Updated 1 year, 6 months ago
9.3K runs

cuuupid/cogvideox-5bGenerate high quality videos from a prompt
Updated 1 year, 7 months ago
2.6K runs

meta/sam-2-videoSAM 2: Segment Anything v2 (for videos)
Updated 1 year, 8 months ago
65.5K runs

fofr/tooncrafterCreate videos from illustrated input images
Updated 1 year, 9 months ago
68.1K runs

fofr/video-morpherGenerate a video that morphs between subjects, with an optional style
Updated 1 year, 11 months ago
15.2K runs

cjwbw/videocrafterVideoCrafter2: Text-to-Video and Image-to-Video Generation and Editing
Updated 2 years, 2 months ago
167.4K runs

ali-vilab/i2vgen-xlRESEARCH/NON-COMMERCIAL USE ONLY: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models
Updated 2 years, 3 months ago
128.4K runs

open-mmlab/piaPersonalized Image Animator
Updated 2 years, 3 months ago
103.5K runs

zsxkib/animatediff-illusionsMonster Labs' Controlnet QR Code Monster v2 For SD-1.5 on top of AnimateDiff Prompt Travel (Motion Module SD 1.5 v2)
Updated 2 years, 5 months ago
10.6K runs

lucataco/hotshot-xl😊 Hotshot-XL is an AI text-to-GIF model trained to work alongside Stable Diffusion XL
Updated 2 years, 5 months ago
928.2K runs

zsxkib/animatediff-prompt-travel🎨AnimateDiff Prompt Travel🧭 Seamlessly Navigate and Animate Between Text-to-Image Prompts for Dynamic Visual Narratives
Updated 2 years, 6 months ago
5.7K runs

zsxkib/animate-diff🎨 AnimateDiff (w/ MotionLoRAs for Panning, Zooming, etc): Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Updated 2 years, 6 months ago
59.4K runs
lucataco/animate-diffAnimate Your Personalized Text-to-Image Diffusion Models
Updated 2 years, 6 months ago
334.6K runs

anotherjesse/zeroscope-v2-xlZeroscope V2 XL & 576w
Updated 2 years, 9 months ago
303.7K runs
cjwbw/controlvideoTraining-free Controllable Text-to-Video Generation
Updated 2 years, 10 months ago
2.4K runs
cjwbw/text2video-zeroText-to-Image Diffusion Models are Zero-Shot Video Generators
Updated 3 years ago
42.1K runs
cjwbw/damo-text-to-videoMulti-stage text-to-video generation
Updated 3 years ago
158.3K runs
andreasjansson/tile-morphCreate tileable animations with seamless transitions
Updated 3 years, 2 months ago
529.4K runs

arielreplicate/deoldify_videoAdd colours to old video footage.
Updated 3 years, 2 months ago
15.1K runs

pollinations/real-basicvsr-video-superresolutionRealBasicVSR: Investigating Tradeoffs in Real-World Video Super-Resolution
Updated 3 years, 2 months ago
9.3K runs

arielreplicate/robust_video_mattingextract foreground of a video
Updated 3 years, 4 months ago
113.4K runs

arielreplicate/stable_diffusion_infinite_zoomUse Runway's Stable-diffusion inpainting model to create an infinite loop video
Updated 3 years, 5 months ago
38.5K runs
andreasjansson/stable-diffusion-animationAnimate Stable Diffusion by interpolating between two prompts
Updated 3 years, 5 months ago
119.6K runs
deforum/deforum_stable_diffusionAnimating prompts with stable diffusion
Updated 3 years, 7 months ago
267.4K runs