Explore

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

black-forest-labs/flux-dev

A 12 billion parameter rectified flow transformer capable of generating images from text descriptions

Updated 13.4M runs

FalconAIs NSFW detection model, extended for videos

Updated 20.9K runs

Updated 19.8K runs

Updated 108 runs

Updated 8 runs

Updated 15 runs

Updated 50 runs

Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models.

Updated 46 runs

Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models.

Updated 78 runs

Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models.

Updated 825 runs

Updated 62 runs

Updated 52 runs

Updated 41 runs

Updated 29 runs

An experimental model for testing out different failure modes

Updated 16 runs

Run Wan2.1 14b or 1.3b with a lora

Updated 1K runs

Photomaker V1 optimized with Lightning 8steps

Updated 26 runs

Inpainting and video2video experiments with Wan 2.1

Updated 103 runs

Updated 62 runs

This model generates pose variation of a cartoon character. It preserves the cartoon identity. Use this model to augment training dataset for any cartoon character created through AI. The augmented dataset can be used to train a LoRA model.

Updated 3.3K runs

PNG Generation Model https://hipng.com/

Updated 36 runs

Updated 21.1K runs

Updated 41 runs

Updated 10.2K runs

Updated 275 runs

SOTA Open Source TTS

Updated 158 runs

"DiffRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion"

Updated 66 runs

Updated 20 runs

Microsoft Magma: A Foundation Model for Multimodal AI Agents

Updated 15 runs

Updated 49 runs

Updated 14 runs

Updated 59 runs

CogView-4 model, which has 6B parameters, supports native Chinese input, and Chinese text-to-image generation.

Updated 62 runs

QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning

Updated 80 runs

Run any ComfyUI workflow. Guide: https://github.com/replicate/cog-comfyui

Updated 2.6M runs

Updated 1K runs

Updated 68 runs

Updated 26 runs

Updated 27 runs

Updated 67 runs

ibm-granite/granite-vision-3.2-2b

Granite-Vision-3.2-2B is a compact and efficient vision-language model, specifically designed for visual document understanding.

Updated 6.2K runs

Updated 79 runs