Explore

I want to…

Use official models

Official models are always on, maintained, and have predictable pricing.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

Updated 673 runs

Updated 20K runs

Updated 19 runs

Microsoft Magma: A Foundation Model for Multimodal AI Agents

Updated 13 runs

Updated 44 runs

Updated 12 runs

Updated 7 runs

Inpainting and video2video experiments with Wan 2.1

Updated 69 runs

Updated 15 runs

Updated 269 runs

Updated 451 runs

Enhance your video quality with AI

Updated 11 runs

Updated 6 runs

Updated 17 runs

Updated 48 runs

CogView-4 model, which has 6B parameters, supports native Chinese input, and Chinese text-to-image generation.

Updated 39 runs

QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning

Updated 24 runs

Run any ComfyUI workflow. Guide: https://github.com/replicate/cog-comfyui

Updated 2.3M runs

Updated 394 runs

Updated 55 runs

Updated 21 runs

Updated 13 runs

Updated 60 runs

ibm-granite/granite-vision-3.2-2b

Granite-Vision-3.2-2B is a compact and efficient vision-language model, specifically designed for visual document understanding.

Updated 4.6K runs

Updated 71 runs

Updated 123 runs

flux dev

Updated 87.5K runs

Wan 2.1 1.3b Video to Video. Wan is a powerful visual generation model developed by Tongyi Lab of Alibaba Group

Updated 33 runs

Updated 9K runs

Updated 28 runs

Removes furniture

Updated 562 runs

An upscaler based on tile and inpaint controlnets, aimed to preserve the original image while injecting more details.

Updated 90 runs

LatentSync: generate high-quality lip sync animations

Updated 15.2K runs

Updated 27.4K runs

SOTA Open Source TTS

Updated 138 runs

Updated 59 runs

minimax/image-01

Minimax's first image model

Updated 9K runs

Updated 255 runs

Updated 5 runs

Updated 15 runs

Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

Updated 1.6K runs

Updated 185 runs

Updated 81 runs

No pages por DeepResearch. Este nuevo agente de investigación viene a resolver todo por ti, solo dale tiempo :D

Updated 77 runs

Phi-4-multimodal-instruct is a lightweight open multimodal foundation model that leverages the language, vision, and speech research and datasets used for Phi-3.5 and 4.0 models.

Updated 116 runs

wavespeedai/wan-2.1-i2v-480p

Accelerated inference for Wan 2.1 14B image to video, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.

Updated 15.2K runs

wavespeedai/wan-2.1-t2v-480p

Accelerated inference for Wan 2.1 14B text to video, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.

Updated 8.2K runs