Explore

Fine-tune FLUX fast
Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate
Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.
Featured models

google / veo-3-fast
A faster and cheaper version of Google’s Veo 3 video model, with audio
zsxkib / thinksound
Generate contextual audio from video using step-by-step reasoning🎶
minimax / hailuo-02
Hailuo 2 is a text-to-video and image-to-video model that can make 6s or 10s videos at 720p (standard) or 1080p (pro). It excels at real world physics.

bytedance / seedance-1-pro
A pro version of Seedance that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 1080p resolution

black-forest-labs / flux-kontext-pro
A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language

runwayml / gen4-image
Runway's Gen-4 Image model with references. Use up to 3 reference images to create the exact image you need. Capture every angle.

fofr / any-comfyui-workflow
Run any ComfyUI workflow. Guide: https://github.com/replicate/cog-comfyui

ideogram-ai / ideogram-v3-turbo
Turbo is the fastest and cheapest Ideogram v3. v3 creates images with stunning realism, creative designs, and consistent styles

prunaai / hidream-l1-fast
This is an optimised version of the hidream-l1 model using the pruna ai optimisation toolkit!
Official models
Official models are always on, maintained, and have predictable pricing.
I want to…
Generate images
Models that generate images from text prompts
Caption videos
Models that generate text from videos
Generate speech
Convert text to speech
Use a face to make images
Make realistic images of people instantly
Generate videos
Models that create and edit videos
Upscale images
Upscaling models that create high-quality images from low-quality images
Generate music
Models to generate and modify music
Edit images
Tools for editing images.
Transcribe speech
Models that convert speech to text
Extract text from images
Optical character recognition (OCR) and text extraction
Remove backgrounds
Models that remove backgrounds from images and videos
Use the FLUX family of models
The FLUX family of text-to-image models from Black Forest Labs
Restore images
Models that improve or restore images by deblurring, colorization, and removing noise
Enhance videos
Upscaling models that create high-quality video from low-quality videos
Use LLMs
Models that can understand and generate text
Edit Videos
Tools for editing videos.
Make 3D stuff
Models that generate 3D objects, scenes, radiance fields, textures and multi-views.
Make videos with Wan2.1
Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.
Caption images
Models that generate text from images
Chat with images
Ask language models about images
Use handy tools
Toolbelt-type models for videos and images.
Control image generation
Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.
Sing with voices
Voice-to-voice cloning and musical prosody
Get embeddings
Models that generate embeddings from inputs
Try for free
Get started with these models without adding a credit card. Whether you're making videos, generating images, or upscaling photos, these are great starting points.
Use official models
Official models are always on, maintained, and have predictable pricing.
Detect objects
Models that detect or segment objects in images and videos.
Use FLUX fine-tunes
Browse the diverse range of fine-tunes the community has custom-trained on Replicate
Popular models
This is the fastest Flux Dev endpoint in the world, contact us for more at pruna.ai
Practical face restoration algorithm for *old photos* or *AI-generated faces*
SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps
Return CLIP features for the clip-vit-large-patch14 model
Real-ESRGAN with optional face correction and adjustable upscale
Latest models
A faster and cheaper version of Google’s Veo 3 video model, with audio
Generate contextual audio from video using step-by-step reasoning🎶
Hailuo 2 is a text-to-video and image-to-video model that can make 6s or 10s videos at 720p (standard) or 1080p (pro). It excels at real world physics.
A video generation model that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 720p resolution
Bria Expand expands images beyond their borders in high quality. Resizing the image by generating new pixels to expand to the desired aspect ratio. Trained exclusively on licensed data for safe and risk-free commercial use
Bria Background Generation allows for efficient swapping of backgrounds in images via text prompts or reference image, delivering realistic and polished results. Trained exclusively on licensed data for safe and risk-free commercial use
Bria GenFill enables high-quality object addition or visual transformation. Trained exclusively on licensed data for safe and risk-free commercial use.
Commercial-ready, trained entirely on licensed data, text-to-image model. With only 4B parameters provides exceptional aesthetics and text rendering. Evaluated to be on par to other leading models in the market
Quickly make 5s or 8s videos at 540p, 720p or 1080p. It has enhanced motion, prompt coherence and handles complex actions well.
Quickly generate smooth 5s or 8s videos at 540p, 720p or 1080p
Create videos in as little as 10 seconds. 5s or 8s videos at 360p, 540p, 720p or 1080p.
A pro version of Seedance that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 1080p resolution
GPU accelerated replay renderer / video data clipper for comma.ai connect's openpilot route data. SEE README.
[Quality Mode] Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
SOTA Object removal, enables precise removal of unwanted objects from images while maintaining high-quality outputs. Trained exclusively on licensed data for safe and risk-free commercial use
Bria Increase resolution upscales the resolution of any image. It increases resolution using a dedicated upscaling method that preserves the original image content without regeneration.
Flux Content Filter - Check for public figures and copyright concerns
Open-weight version of FLUX.1 Kontext via Hugging Face Diffusers
A premium text-based image editing model that delivers maximum performance and improved typography generation for transforming images through natural language prompts
A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language
Fast endpoint for Flux Kontext, optimized with pruna framework
Runway's Gen-4 Image model with references. Use up to 3 reference images to create the exact image you need. Capture every angle.
Use flux-kontext-pro to change the first or last frame of a video. Useful to use as inputs for restyling an entire video in a certain way
Audio-driven multi-person conversational video generation - Upload audio files and a reference image to create realistic conversations between multiple people
Generates unrestricted images from text prompts using a fine-tuned Stable Diffusion model
Recognise, describe and retrieve data within an image with great accuracy.
Run any ComfyUI workflow. Guide: https://github.com/replicate/cog-comfyui
A text-to-image model with support for native high-resolution (2K) image generation
A version of flux-dev, a text to image model, that supports fast fine-tuned lora inference
A 12 billion parameter rectified flow transformer capable of generating images from text descriptions
The fastest image generation model tailored for local development and personal use
The fastest image generation model tailored for fine-tuned use