Explore

I want to…

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Flux fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Latest models

Detect hate speech or toxic comments in tweets/texts

Updated 67.3K runs

Kolors with style transfer, composition transfer and other IPAdapter techniques

Updated 16.4K runs

Largest completely open sourced flow-based generation model that is capable of text-to-image generation

Updated 5.7K runs

A large-scale text-to-image generation model based on latent diffusion, developed by the Kuaishou Kolors team

Updated 20.1K runs

SDXL Finetuned on Zine-Style Portraits

Updated 4K runs

Visual instruction tuning towards large language and vision models with GPT-4 level capabilities

Updated 19.1M runs

Updated 39 runs

Generate seamless 360 photos using SDXL

Updated 355 runs

Updated 649 runs

Face Restoration

Updated 2.5K runs

A text-to-image model with greatly improved performance in image quality, typography, complex prompt understanding, and resource-efficiency

Updated 1.4M runs

MimicMotion: High-quality human motion video generation with pose-guided control

Updated 1.8K runs

remove background for retailer product images

Updated 49 runs

Make realistic images of real people instantly (w/ ip-adapter-plus-face_sdxl_vit-h)

Updated 2.7K runs

PixArt Sigma 900M is a text-to-image generation model based on the PixArt Sigma architecture

Updated 168 runs

Updated 43.8K runs

Updated 93 runs

araby.ai oneshot video faceswap

Updated 6.4K runs

NuminaMath is a series of language models that are trained to solve math problems using tool-integrated reasoning (TIR)

Updated 17 runs

MARS5, a fully open-source (commercially usable) voice-cloning/TTS with break-through prosody and realism.

Updated 538 runs

Updated 342 runs

for backsound

Updated 86 runs

Cog wrapper for Ollama deepseek-coder-v2:236b

Updated 374 runs

audio to srt

Updated 27 runs

My Cat Xiaobai

Updated 460 runs

Cog wrapper for Ollama llama3:70b

Updated 100 runs

Cog wrapper for Ollama llama3:8b

Updated 12 runs

Input a video. Ask anything about it

Updated 3.4K runs

YOLOv10: Real-Time End-to-End Object Detection

Updated 72 runs

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Updated 254 runs

Take audio from one video and add it to a second video. Good for adding back audio to liveportrait.

Updated 195 runs

Change the fps of a video without changing its length or speed

Updated 82 runs

Portrait animation using a driving video source

Updated 60.1K runs

Efficient Portrait Animation with Stitching and Retargeting Control

Updated 1.2K runs

Kolors is a SOTA base image model for high quality image generation

Updated 1.2K runs

Updated 13 runs

Updated 83 runs

Updated 24 runs

The API automatically detects objects in an input image and returns their positional and mask information.

Updated 4K runs

Convert images to anime style

Updated 224.2K runs

Create music for your content

Updated 198.4K runs

Updated 374 runs

Mama ママ 2.0 Shinsei Galverse Anime-themed text-to-image model

Updated 1.9K runs

InternLM2.5 has open-sourced a 7 billion parameter base model and a chat model tailored for practical scenarios.

Updated 49 runs

Updated 97 runs

Create videos from illustrated input images

Updated 36.7K runs

Updated 18 runs

Phi-3-Mini-4K-Instruct is a 3.8B parameters, lightweight, state-of-the-art open model trained with the Phi-3 datasets

Updated 95.3K runs

Qwen2 57 billion parameter language model from Alibaba Cloud, fine tuned for chat completions

Updated 1.1K runs

SDXL fine-tune based on images of birds primarily from the British Library free archive

Updated 17 runs