Explore

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

Updated 140 runs

flux dev

Updated 92.9K runs

Fish Speech V1.5-SOTA Open Source TTS

Updated 174 runs

A model Flux.1-dev-Controlnet-Upscaler by www.androcoders.in

Updated 13 runs

Controllable generative AI art

Updated 312 runs

Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

Updated 2.3K runs

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Updated 568 runs

Updated 2.3K runs

Orpheus 3B - high quality, emotive Text to Speech

Updated 513 runs

Updated 2.6K runs

This model generates pose variation of a cartoon character. It preserves the cartoon identity. Use this model to augment training dataset for any cartoon character created through AI. The augmented dataset can be used to train a LoRA model.

Updated 3.5K runs

Run any ComfyUI workflow. Guide: https://github.com/replicate/cog-comfyui

Updated 2.9M runs

kwaivgi/kling-v1.6-pro

Generate 5s and 10s videos in 1080p resolution

Updated 77K runs

CosyVoice2-0.5B-Scalable Streaming Speech Synthesis with Large Language Models

Updated 579 runs

A model Flux.1-dev-Controlnet-Upscaler by www.androcoders.in

Updated 369 runs

Realistic Inpainting with ControlNET (M-LSD + SEG)

Updated 398.2K runs

Hunyuan3D-2mv is finetuned from Hunyuan3D-2 to support multiview controlled shape generation.

Updated 321 runs

LatentSync: generate high-quality lip sync animations

Updated 18.9K runs

detect correct orientation of images

Updated 17 runs

Hyper FLUX 8-step by ByteDance

Updated 7.9M runs

ShieldGemma 2 is a model trained on Gemma 3's 4B IT checkpoint for image safety classification across key categories that takes in images and outputs safety labels per policy.

Updated 5 runs

black-forest-labs/flux-redux-schnell

Fast, efficient image variation model for rapid iteration and experimentation.

Updated 23.5K runs

black-forest-labs/flux-redux-dev

Open-weight image variation model. Create new versions while preserving key elements of your original.

Updated 158.3K runs

CSM (Conversational Speech Model) is a speech generation model from Sesame that generates RVQ audio codes from text and audio inputs

Updated 64 runs

black-forest-labs/flux-schnell-lora

The fastest image generation model tailored for fine-tuned use

Updated 797.3K runs

black-forest-labs/flux-schnell

The fastest image generation model tailored for local development and personal use

Updated 256.6M runs

Updated 311 runs

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 837.2M runs

Run Wan2.1 14b or 1.3b with a lora

Updated 3K runs

Social Media Comment Analysis

Updated 37 runs

Wan2.1 14B 480p LoRA inference via Diffusers (Work in progress)

Updated 195 runs

black-forest-labs/flux-depth-dev

Open-weight depth-aware image generation. Edit images while preserving spatial relationships.

Updated 161.9K runs

black-forest-labs/flux-canny-dev

Open-weight edge-guided image generation. Control structure and composition using Canny edge detection.

Updated 52.9K runs

black-forest-labs/flux-dev-lora

A version of flux-dev, a text to image model, that supports fast fine-tuned lora inference

Updated 501K runs

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Updated 9.6K runs

a test run for hello world

Updated 4 runs

black-forest-labs/flux-dev

A 12 billion parameter rectified flow transformer capable of generating images from text descriptions

Updated 14M runs

FalconAIs NSFW detection model, extended for videos

Updated 22K runs

luma/ray-flash-2-720p

Generate 5s and 9s 720p videos, faster and cheaper than Ray 2

Updated 1.8K runs

luma/ray-flash-2-540p

Generate 5s and 9s 540p videos, faster and cheaper than Ray 2

Updated 416 runs

Updated 19.9K runs

Updated 141 runs

Updated 14 runs

Updated 45 runs

Updated 56 runs

Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models.

Updated 253 runs

Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models.

Updated 279 runs

Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models.

Updated 2.2K runs

Updated 464 runs