Explore

I want to…

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Upscale images

Upscaling models that create high-quality images from low-quality images

Train a language model

Language models that you can fine-tune using Replicate's training API.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Latest models

InstantID. ControlNets. More base SDXL models. And the latest ByteDance's ⚡️SDXL-Lightning !⚡️

Updated 83.1K runs

The img2img pipeline that makes an anime-style image of a person. It uses one of sd1.5 models as a base, depth-estimation as a ControleNet and IPadapter model for face consistency.

Updated 23 runs

GPU accelerated replay renderer / video data clipper for comma.ai connect's openpilot route data. SEE README.

Updated 2K runs

Consistent Self-Attention for Long-Range Image and Video Generation

Updated 304 runs

Updated 449 runs

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

Updated 768 runs

Robust face restoration algorithm for old photos / AI-generated faces (adapted to work with video inputs)

Updated 40 runs

Run any ComfyUI workflow. Guide: https://github.com/fofr/cog-comfyui

Updated 183.8K runs

Updated 19 runs

📖 PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Updated 3.6K runs

Semantic Segmentation

Updated 112.2K runs

A SDXL Model trained from another SDXL-hiroshinagai model images

Updated 69 runs

Train SDXL 1.0 with LoRA | mixed precision bf16 and save precision fp16

Updated 227 runs

Just some good ole beautifulsoup scrapping URL magic. (some sites don't work as they block scrapping, but still useful)

Updated 869 runs

High resolution image Upscaler and Enhancer. Use at ClarityAI.cc. A free Magnific alternative. Twitter/X: @philz1337x

Updated 1M runs

Realistic Inpainting with ControlNET (M-LSD + SEG)

Updated 3.1K runs

Tango2: LLM-guided Diffusion-based Text-to-Audio Generation and DPO-based Alignment

Updated 18.4K runs

a fine-tuned model to detect dragon in images.

Updated 17 runs

🗣️ TalkNet-ASD: Detect who is speaking in a video

Updated 45 runs

Segments skies for outdoors

Updated 2K runs

Transfer a material from an image to a subject

Updated 1.7K runs

Updated 30 runs

Demucs is an audio source separator created by Facebook Research.

Updated 340.7K runs

Updated 53 runs

Creates voxels like game asset

Updated 172 runs

Projection module trained to add vision capabilties to Llama 3 using SigLIP

Updated 1.1K runs

Updated 58 runs

Updated 83 runs

Updated 4K runs

Uses 'Align your steps' for faster higher quality images

Updated 839 runs

Dark Sushi Mix 2.25D Model with vae-ft-mse-840000-ema (Text2Img, Img2Img and Inpainting)

Updated 38.9K runs

Convert story to StableDiffusion prompts format

Updated 20 runs

llava-phi-3-mini is a LLaVA model fine-tuned from microsoft/Phi-3-mini-4k-instruct

Updated 59 runs

PyTorch implementation of AnimeGAN for fast photo animation

Updated 30.8K runs

Updated 9 runs

Function calling with llama-3 with prompting only.

Updated 184 runs

Updated to OpenVoice v2: Versatile Instant Voice Cloning

Updated 4K runs

Updated 29 runs

Updated 78 runs

Qwen1.5 is the beta version of Qwen2, a transformer-based decoder-only language model pretrained on a large amount of data

Updated 2.4K runs

Qwen1.5 is the beta version of Qwen2, a transformer-based decoder-only language model pretrained on a large amount of data

Updated 20 runs

Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis

Updated 528 runs

llm model ,for CN

Updated 209 runs

Reliberate v3 Model (Text2Img, Img2Img and Inpainting)

Updated 266.9K runs

Deliberate V6 Model (Text2Img, Img2Img and Inpainting)

Updated 3.8K runs

AbsoluteReality V1.8.1 Model (Text2Img, Img2Img and Inpainting)

Updated 14K runs

Updated 11 runs

Make realistic images of real people instantly

Updated 386.1K runs

Phi-3-Mini-128K-Instruct is a 3.8 billion-parameter, lightweight, state-of-the-art open model trained using the Phi-3 datasets

Updated 1.1K runs

Phi-3-Mini-4K-Instruct is a 3.8B parameters, lightweight, state-of-the-art open model trained with the Phi-3 datasets

Updated 72.1K runs