Explore

I want to…

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Upscale images

Upscaling models that create high-quality images from low-quality images

Train a language model

Language models that you can fine-tune using Replicate's training API.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Latest models

Accelerated transcription, word-level timestamps and diarization with whisperX large-v3 for large audio files. Fork of victor-upmeet/whisperx-a40-large with diarization fixed and group_segments implemented

Updated 13 runs

A 14B parameter, lightweight, state-of-the-art open model trained with the Phi-3 datasets that includes both synthetic data and the filtered publicly available websites data with a focus on high-quality and reasoning dense pro

Updated 7 runs

Another Tyler; custom to your liking

Updated 43 runs

Ultra high resolution images (up to 4096x4096) based on Stable Cascade

Updated 221 runs

https://civitai.com/models/317902

Updated 1.4K runs

SDXL Canny controlnet with LoRA support.

Updated 17.6K runs

Seamlessly create stunning product shots by blending with inspirational references for a fresh, modern look

Updated 25 runs

Detect hate speech or toxic comments in tweets/texts

Updated 1.9K runs

Kolors with style transfer, composition transfer and other IPAdapter techniques

Updated 2.3K runs

Largest completely open sourced flow-based generation model that is capable of text-to-image generation

Updated 3K runs

A large-scale text-to-image generation model based on latent diffusion, developed by the Kuaishou Kolors team

Updated 6.1K runs

From Sketch to Reality: Transforming Outlines into Lifelike Images

Updated 18.9K runs

Visual instruction tuning towards large language and vision models with GPT-4 level capabilities

Updated 14.3M runs

Run any ComfyUI workflow. Guide: https://github.com/fofr/cog-comfyui

Updated 444.5K runs

Generate seamless 360 photos using SDXL

Updated 91 runs

Updated 618 runs

Real-ESRGAN for image upscaling on an A100

Updated 10.4M runs

Face Restoration

Updated 2.4K runs

A text-to-image model with greatly improved performance in image quality, typography, complex prompt understanding, and resource-efficiency

Updated 582.8K runs

MimicMotion: High-quality human motion video generation with pose-guided control

Updated 1.1K runs

Pinga marvada. Fine-tuned on modão tracks with the text token "modao"

Updated 24 runs

remove background for retailer product images

Updated 21 runs

Make realistic images of real people instantly (w/ ip-adapter-plus-face_sdxl_vit-h)

Updated 196 runs

Qwen 2: A 72 billion parameter language model fine tuned for chat completions

Updated 99 runs

PixArt Sigma 900M is a text-to-image generation model based on the PixArt Sigma architecture

Updated 116 runs

Updated 8.5K runs

Updated 71 runs

araby.ai oneshot faceswap

Updated 1.5K runs

Detects if a picture has anime face.

Updated 10.1K runs

NuminaMath is a series of language models that are trained to solve math problems using tool-integrated reasoning (TIR)

Updated 11 runs

MARS5, a fully open-source (commercially usable) voice-cloning/TTS with break-through prosody and realism.

Updated 319 runs

Updated 68 runs

GPU accelerated replay renderer / video data clipper for comma.ai connect's openpilot route data. SEE README.

Updated 2.4K runs

The Mistral-7B-Instruct-v0.3 Large Language Model is an instruct fine-tuned version of the Mistral-7B-v0.3

Updated 10 runs

for backsound

Updated 52 runs

Convert speech in audio to text

Updated 19.6M runs

Generate high resolution image

Updated 1K runs

Cog wrapper for Ollama deepseek-coder-v2:236b

Updated 153 runs

audio to srt

Updated 5 runs

My Cat Xiaobai

Updated 388 runs

Cog wrapper for Ollama llama3:70b

Updated 6 runs

Cog wrapper for Ollama llama3:8b

Updated 6 runs

Input a video. Ask anything about it

Updated 75 runs

YOLOv10: Real-Time End-to-End Object Detection

Updated 19 runs

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Updated 187 runs

Take audio from one video and add it to a second video. Good for adding back audio to liveportrait.

Updated 76 runs

Change the fps of a video without changing its length or speed

Updated 64 runs

Portrait animation using a driving video source

Updated 24.2K runs

⚡️ Fast audio transcription | whisper large-v3 | speaker diarization | word & sentence level timestamps | prompt | hotwords

Updated 447.2K runs

Efficient Portrait Animation with Stitching and Retargeting Control

Updated 728 runs