These models can generate and edit videos from text prompts and images. They use advanced AI techniques like diffusion models and latent space interpolation to create high-quality, controllable video content.
Key capabilities:
For most people looking to generate custom videos from text prompts, we recommend google/veo-3-fast. Videos can also be generated with a 9:16 aspect ratio for short-form content.
(Btw, we also do recommend its successors google/veo-3.1 and google/veo-3.1-fast, but we do tend to see some overbaked results)
Kling v2.6 is a top-tier image-to-video with cinematic visuals, fluid motion, and native audio generation. It is definitely our choice for rendering accurate and complex physics.
With its fuzzy diffusion quality, Sora 2 is great at developing outputs with that home video/realistic quality. You can really push this model to its limits.
The Wan video models model by Wan-AI is an excellent open-source option, competitive with the best proprietary video models. Try adjusting the number of steps used for each frame to trade off between generation speed and detail.
Generative video is a rapidly advancing field. Check out the arena and leaderboard at Artificial Analysis to see what's popular today.
Featured models

openai/sora-2OpenAI's Flagship video generation with synced audio
Updated 1 week ago
190.4K runs
Kling 2.5 Turbo Pro: Unlock pro-level text-to-video and image-to-video creation with smooth motion, cinematic depth, and remarkable prompt adherence.
Updated 2 weeks, 1 day ago
1.7M runs
Alibaba Wan 2.5 text to video generation model
Updated 1 month, 3 weeks ago
31.2K runs
Alibaba Wan 2.5 Image to video generation with background audio
Updated 1 month, 3 weeks ago
173.3K runs

New and improved version of Veo 3 Fast, with higher-fidelity video, context-aware audio and last frame support
Updated 2 months ago
328.3K runs
New and improved version of Veo 3, with higher-fidelity video, context-aware audio, reference image and last frame support
Updated 2 months ago
312.9K runs

Create 5s-8s videos with enhanced character movement, visual effects, and exclusive 1080p-8s support. Optimized for anime characters and complex actions
Updated 2 months ago
758.7K runs
Wan 2.5 text-to-video, optimized for speed
Updated 2 months ago
39.9K runs
bytedance/seedance-1-pro-fastA faster and cheaper version of Seedance 1 Pro
Updated 2 months, 2 weeks ago
572.9K runs
Wan 2.5 image-to-video, optimized for speed
Updated 2 months, 3 weeks ago
48K runs
A high-fidelity video generation model optimized for realistic human motion, cinematic VFX, expressive characters, and strong prompt and style adherence across both text-to-video and image-to-video workflows
Updated 2 months, 3 weeks ago
45.8K runs
A lower-latency image-to-video version of Hailuo 2.3 that preserves core motion quality, visual consistency, and stylization performance while enabling faster iteration cycles.
Updated 2 months, 3 weeks ago
36.5K runs
Recommended Models
The open-source Wan suite (like wan-video/wan-2.1-t2v-480p) is among the faster text-to-video options on Replicate, especially at lower resolutions and shorter durations. Many models also have “fast” variants, like google/veo-3-fast, designed for quicker turnaround.
Note: Faster runs usually mean lower resolution or simpler motion.
PixVerse v4 offers a strong balance for many use cases. It uses a unit-based system at $0.01 per unit — for example, a 5-second, 360p video costs about $0.30. Hailuo 02 is another good middle-ground option, with both standard and pro modes for different quality levels. Your ideal choice depends on how much resolution and runtime you need and how much you want to spend.
For short, stylized clips (5–10 seconds at lower resolution), PixVerse v4 and Wan models are great picks. They’re fast and relatively inexpensive, making them ideal for concept work, storyboarding, or rapid iteration.
If you want high-fidelity motion, longer clips, or more realistic physics, Veo 3 or Hailuo 02 Pro are better options. Hailuo 02 supports 768p in standard mode and 1080p in Pro mode, which makes it a solid choice for more polished results.
Most text-to-video models generate short video clips (5–10 seconds) at 24 or 30 fps. Supported resolutions range from 360p to 1080p, depending on the model. Some, like Veo 3, can include audio as part of the output.
Costs vary by model and resolution:
You can push your own model by packaging it with Cog and deploying it. If you’re working with open-source video models, you can also fine-tune them and publish your version for others to use.
Yes, but always check the model’s license. Most text-to-video models on Replicate are available for commercial use, but some authors include additional restrictions.
You can use the Replicate playground or run them programmatically.
Recommended Models

openai/sora-2-proOpenAI's Most advanced synced-audio video generation
Updated 1 week ago
76.7K runs
A very fast and cheap PrunaAI optimized version of Wan 2.2 A14B text-to-video
Updated 1 week, 5 days ago
168.8K runs
A very fast and cheap PrunaAI optimized version of Wan 2.2 A14B image-to-video
Updated 1 week, 5 days ago
6.5M runs
VEED Fabric 1.0 is an image-to-video API that turns any image into a talking video
Updated 1 month, 2 weeks ago
2.9K runs

Sound on: Google’s flagship Veo 3 text to video model, with audio
Updated 2 months ago
215K runs

A faster and cheaper version of Google’s Veo 3 video model, with audio
Updated 2 months ago
158.9K runs

State of the art video generation model. Veo 2 can faithfully follow simple and complex instructions, and convincingly simulates real-world physics as well as a wide range of visual styles.
Updated 2 months ago
105.4K runs

Quickly generate smooth 5s or 8s videos at 540p, 720p or 1080p
Updated 2 months ago
39.1K runs

Quickly make 5s or 8s videos at 540p, 720p or 1080p. It has enhanced motion, prompt coherence and handles complex actions well.
Updated 2 months ago
238K runs

Accelerated inference for Wan 2.1 14B text to video with high resolution, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.
Updated 2 months ago
35.3K runs

Accelerated inference for Wan 2.1 14B image to video with high resolution, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.
Updated 2 months ago
86.2K runs

Accelerated inference for Wan 2.1 14B image to video, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.
Updated 2 months ago
436.3K runs

bytedance/seedance-1-proA pro version of Seedance that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 1080p resolution
Updated 2 months, 2 weeks ago
1.5M runs

bytedance/seedance-1-liteA video generation model that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 720p resolution
Updated 2 months, 2 weeks ago
2.6M runs

Create 5s 480p videos from a text prompt
Updated 2 months, 3 weeks ago
10.2K runs

Generate 5s and 10s videos in 720p resolution
Updated 2 months, 3 weeks ago
87.8K runs

Generate 5s and 10s videos in 1080p resolution
Updated 2 months, 3 weeks ago
805.8K runs

A premium version of Kling v2.1 with superb dynamics and prompt adherence. Generate 1080p 5s and 10s videos from text or an image
Updated 2 months, 3 weeks ago
84.8K runs

Generate 5s and 10s videos in 720p resolution at 30fps
Updated 2 months, 3 weeks ago
1.5M runs

Use Kling v2.1 to generate 5s and 10s videos in 720p and 1080p resolution from a starting image (image-to-video)
Updated 2 months, 3 weeks ago
3.3M runs

luma/ray-2-540pGenerate 5s and 9s 540p videos
Updated 2 months, 3 weeks ago
10.8K runs

luma/ray-2-720pGenerate 5s and 9s 720p videos
Updated 2 months, 3 weeks ago
35K runs

Accelerated inference for Wan 2.1 14B text to video, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.
Updated 2 months, 3 weeks ago
183.4K runs

luma/rayFast, high quality text-to-video and image-to-video (Also known as Dream Machine)
Updated 2 months, 3 weeks ago
69.7K runs

luma/ray-flash-2-720pGenerate 5s and 9s 720p videos, faster and cheaper than Ray 2
Updated 2 months, 3 weeks ago
42.6K runs

Generate 6s videos with prompts or images. (Also known as Hailuo). Use a subject reference to make a video with a character and the S2V-01 model.
Updated 2 months, 3 weeks ago
651.1K runs

Generate videos with specific camera movements
Updated 2 months, 3 weeks ago
73.7K runs

An image-to-video (I2V) model specifically trained for Live2D and general animation use cases
Updated 2 months, 3 weeks ago
173.5K runs
luma/ray-flash-2-540pGenerate 5s and 9s 540p videos, faster and cheaper than Ray 2
Updated 2 months, 3 weeks ago
57.2K runs
Hailuo 2 is a text-to-video and image-to-video model that can make 6s or 10s videos at 768p (standard) or 1080p (pro). It excels at real world physics.
Updated 2 months, 3 weeks ago
313.1K runs

Image-to-video at 720p and 480p with Wan 2.2 A14B
Updated 5 months, 2 weeks ago
46.2K runs
fofr/not-realMake a very realistic looking real-world AI video
Updated 6 months, 1 week ago
2.3K runs

Generate 5s 480p videos. Wan is an advanced and powerful visual generation model developed by Tongyi Lab of Alibaba Group
Updated 11 months ago
46.7K runs

tencent/hunyuan-videoA state-of-the-art text-to-video generation model capable of creating high-quality videos with realistic motion from text descriptions
Updated 1 year ago
116.4K runs

lightricks/ltx-videoLTX-Video is the first DiT-based video generation model capable of generating high-quality videos in real-time. It produces 24 FPS videos at a 768x512 resolution faster than they can be watched.
Updated 1 year ago
164.5K runs

zsxkib/hunyuan-video2videoA state-of-the-art text-to-video generation model capable of creating high-quality videos with realistic motion from text descriptions
Updated 1 year, 1 month ago
3K runs

genmoai/mochi-1Mochi 1 preview is an open video generation model with high-fidelity motion and strong prompt adherence in preliminary evaluation
Updated 1 year, 1 month ago
3.1K runs

zsxkib/pyramid-flowText-to-Video + Image-to-Video: Pyramid Flow Autoregressive Video Generation method based on Flow Matching
Updated 1 year, 3 months ago
9.2K runs

cuuupid/cogvideox-5bGenerate high quality videos from a prompt
Updated 1 year, 5 months ago
2.6K runs

meta/sam-2-videoSAM 2: Segment Anything v2 (for videos)
Updated 1 year, 5 months ago
54.7K runs

fofr/tooncrafterCreate videos from illustrated input images
Updated 1 year, 6 months ago
65.6K runs

fofr/video-morpherGenerate a video that morphs between subjects, with an optional style
Updated 1 year, 9 months ago
15.1K runs

cjwbw/videocrafterVideoCrafter2: Text-to-Video and Image-to-Video Generation and Editing
Updated 1 year, 11 months ago
147.3K runs

ali-vilab/i2vgen-xlRESEARCH/NON-COMMERCIAL USE ONLY: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models
Updated 2 years ago
128.3K runs

open-mmlab/piaPersonalized Image Animator
Updated 2 years ago
103.5K runs

zsxkib/animatediff-illusionsMonster Labs' Controlnet QR Code Monster v2 For SD-1.5 on top of AnimateDiff Prompt Travel (Motion Module SD 1.5 v2)
Updated 2 years, 2 months ago
10.5K runs

lucataco/hotshot-xl😊 Hotshot-XL is an AI text-to-GIF model trained to work alongside Stable Diffusion XL
Updated 2 years, 3 months ago
897.1K runs

zsxkib/animatediff-prompt-travel🎨AnimateDiff Prompt Travel🧭 Seamlessly Navigate and Animate Between Text-to-Image Prompts for Dynamic Visual Narratives
Updated 2 years, 3 months ago
5.7K runs

zsxkib/animate-diff🎨 AnimateDiff (w/ MotionLoRAs for Panning, Zooming, etc): Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Updated 2 years, 3 months ago
59.2K runs
lucataco/animate-diffAnimate Your Personalized Text-to-Image Diffusion Models
Updated 2 years, 4 months ago
326.8K runs

anotherjesse/zeroscope-v2-xlZeroscope V2 XL & 576w
Updated 2 years, 6 months ago
302K runs
cjwbw/controlvideoTraining-free Controllable Text-to-Video Generation
Updated 2 years, 8 months ago
2.4K runs
cjwbw/text2video-zeroText-to-Image Diffusion Models are Zero-Shot Video Generators
Updated 2 years, 9 months ago
42K runs
cjwbw/damo-text-to-videoMulti-stage text-to-video generation
Updated 2 years, 10 months ago
156.2K runs
andreasjansson/tile-morphCreate tileable animations with seamless transitions
Updated 2 years, 11 months ago
529.4K runs

arielreplicate/deoldify_videoAdd colours to old video footage.
Updated 2 years, 11 months ago
10.1K runs

pollinations/real-basicvsr-video-superresolutionRealBasicVSR: Investigating Tradeoffs in Real-World Video Super-Resolution
Updated 2 years, 11 months ago
9.3K runs

arielreplicate/robust_video_mattingextract foreground of a video
Updated 3 years, 1 month ago
74.7K runs

arielreplicate/stable_diffusion_infinite_zoomUse Runway's Stable-diffusion inpainting model to create an infinite loop video
Updated 3 years, 2 months ago
38.5K runs
andreasjansson/stable-diffusion-animationAnimate Stable Diffusion by interpolating between two prompts
Updated 3 years, 2 months ago
119.6K runs
nateraw/stable-diffusion-videosGenerate videos by interpolating the latent space of Stable Diffusion
Updated 3 years, 4 months ago
58.5K runs
deforum/deforum_stable_diffusionAnimating prompts with stable diffusion
Updated 3 years, 4 months ago
266.5K runs