I want to…

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Upscale images

Upscaling models that create high-quality images from low-quality images

Train a language model

Language models that you can fine-tune using Replicate's training API.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Latest models

Accelerated transcription, word-level timestamps and diarization with whisperX large-v3 for large audio files. Fork of victor-upmeet/whisperx-a40-large with diarization fixed and group_segments implemented

Updated 13 runs

A 14B parameter, lightweight, state-of-the-art open model trained with the Phi-3 datasets that includes both synthetic data and the filtered publicly available websites data with a focus on high-quality and reasoning dense pro

Updated 7 runs

Another Tyler; custom to your liking

Updated 43 runs

Ultra high resolution images (up to 4096x4096) based on Stable Cascade

Updated 221 runs


Updated 1.4K runs

SDXL Canny controlnet with LoRA support.

Updated 17.6K runs

Seamlessly create stunning product shots by blending with inspirational references for a fresh, modern look

Updated 25 runs

Detect hate speech or toxic comments in tweets/texts

Updated 1.9K runs

Kolors with style transfer, composition transfer and other IPAdapter techniques

Updated 2.3K runs

Largest completely open sourced flow-based generation model that is capable of text-to-image generation

Updated 3K runs

A large-scale text-to-image generation model based on latent diffusion, developed by the Kuaishou Kolors team

Updated 6.1K runs

From Sketch to Reality: Transforming Outlines into Lifelike Images

Updated 18.9K runs

Visual instruction tuning towards large language and vision models with GPT-4 level capabilities

Updated 14.3M runs

Run any ComfyUI workflow. Guide: https://github.com/fofr/cog-comfyui

Updated 444.5K runs

Generate seamless 360 photos using SDXL

Updated 91 runs

Updated 618 runs

Real-ESRGAN for image upscaling on an A100

Updated 10.4M runs

Face Restoration

Updated 2.4K runs

A text-to-image model with greatly improved performance in image quality, typography, complex prompt understanding, and resource-efficiency

Updated 582.8K runs

MimicMotion: High-quality human motion video generation with pose-guided control

Updated 1.1K runs

Pinga marvada. Fine-tuned on modão tracks with the text token "modao"

Updated 24 runs

remove background for retailer product images

Updated 21 runs

Make realistic images of real people instantly (w/ ip-adapter-plus-face_sdxl_vit-h)

Updated 196 runs

Qwen 2: A 72 billion parameter language model fine tuned for chat completions

Updated 99 runs

PixArt Sigma 900M is a text-to-image generation model based on the PixArt Sigma architecture

Updated 116 runs

Updated 8.5K runs

Updated 71 runs

araby.ai oneshot faceswap

Updated 1.5K runs

Detects if a picture has anime face.

Updated 10.1K runs

NuminaMath is a series of language models that are trained to solve math problems using tool-integrated reasoning (TIR)

Updated 11 runs

MARS5, a fully open-source (commercially usable) voice-cloning/TTS with break-through prosody and realism.

Updated 319 runs

Updated 68 runs

GPU accelerated replay renderer / video data clipper for comma.ai connect's openpilot route data. SEE README.

Updated 2.4K runs

The Mistral-7B-Instruct-v0.3 Large Language Model is an instruct fine-tuned version of the Mistral-7B-v0.3

Updated 10 runs

for backsound

Updated 52 runs

Convert speech in audio to text

Updated 19.6M runs

Generate high resolution image

Updated 1K runs

Cog wrapper for Ollama deepseek-coder-v2:236b

Updated 153 runs

audio to srt

Updated 5 runs

My Cat Xiaobai

Updated 388 runs

Cog wrapper for Ollama llama3:70b

Updated 6 runs

Cog wrapper for Ollama llama3:8b

Updated 6 runs

Input a video. Ask anything about it

Updated 75 runs

YOLOv10: Real-Time End-to-End Object Detection

Updated 19 runs

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Updated 187 runs

Take audio from one video and add it to a second video. Good for adding back audio to liveportrait.

Updated 76 runs

Change the fps of a video without changing its length or speed

Updated 64 runs

Portrait animation using a driving video source

Updated 24.2K runs

⚡️ Fast audio transcription | whisper large-v3 | speaker diarization | word & sentence level timestamps | prompt | hotwords

Updated 447.2K runs

Efficient Portrait Animation with Stitching and Retargeting Control

Updated 728 runs