Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Official models

Official models are always on, maintained, and have predictable pricing.

View all official models

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

Updated 98 runs

Updated 39.7K runs

Updated 524 runs

Updated 1.4K runs

Updated 1.4K runs

Updated 2.8K runs

Updated 1.4K runs

Extract the first or last frame from any video file as a high-quality image

Updated 240 runs

CSM (Conversational Speech Model) is a speech generation model from Sesame that generates RVQ audio codes from text and audio inputs

Updated 386 runs

flux-1.dev

Updated 20 runs

Updated 498 runs

Fish Speech V1.5-SOTA Open Source TTS

Updated 368 runs

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Updated 2.3K runs

Updated 101.4K runs

Orpheus 3B - high quality, emotive Text to Speech

Updated 15.4K runs

Updated 90.3K runs

CosyVoice2-0.5B-Scalable Streaming Speech Synthesis with Large Language Models

Updated 1K runs

A model Flux.1-dev-Controlnet-Upscaler by www.androcoders.in

Updated 440 runs

Realistic Inpainting with ControlNET (M-LSD + SEG)

Updated 475.6K runs

Hunyuan3D-2mv is finetuned from Hunyuan3D-2 to support multiview controlled shape generation.

Updated 1.1K runs

LatentSync: generate high-quality lip sync animations

Updated 37.9K runs

detect correct orientation of images

Updated 18 runs

Hyper FLUX 8-step by ByteDance

Updated 12.7M runs

ShieldGemma 2 is a model trained on Gemma 3's 4B IT checkpoint for image safety classification across key categories that takes in images and outputs safety labels per policy.

Updated 62 runs

Fast, efficient image variation model for rapid iteration and experimentation.

Updated 36.3K runs

Open-weight image variation model. Create new versions while preserving key elements of your original.

Updated 229.3K runs

The fastest image generation model tailored for local development and personal use

Updated 347.9M runs

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 964.8M runs

Run Wan2.1 14b or 1.3b with a lora

Updated 21.7K runs

Wan2.1 14B 480p LoRA inference via Diffusers (Work in progress)

Updated 501 runs

Open-weight depth-aware image generation. Edit images while preserving spatial relationships.

Updated 378.7K runs

Open-weight edge-guided image generation. Control structure and composition using Canny edge detection.

Updated 89.7K runs

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Updated 10.3K runs

a test run for hello world

Updated 6 runs

A 12 billion parameter rectified flow transformer capable of generating images from text descriptions

Updated 19M runs

FalconAIs NSFW detection model, extended for videos

Updated 34.7K runs

Updated 488 runs

Updated 205 runs

Updated 259 runs

Updated 349 runs

Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models.

Updated 2.7K runs

Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models.

Updated 7.9K runs

Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models.

Updated 4.7K runs

Updated 745 runs

Updated 307 runs

PuLID-FLUX-v0.9.0

Updated 156 runs

Updated 434 runs

An experimental model for testing out different failure modes

Updated 47 runs