Explore

I want to…

Use official models

Official models are always on, maintained, and have predictable pricing.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

blue_pencil-XL meets ANIMAGINE XL 3.0 / ANIMAGINE XL 3.1, The top ranked model on Civitai

Updated 3.2K runs

Updated 112 runs

This is an implementation of the ChatTTS as a Cog model.

Updated 3K runs

Stylized Audio-Driven Single Image Talking Face Animation

Updated 123.3K runs

Recreate images with Emojis

Updated 199 runs

Fast and High-Quality Text-to-video Generation

Updated 4.4K runs

A PhotoBooth style transfer workflow that utilizes IPadapter Style, Canny, OpenPose, RemoveBackground, HumanSegmentation, Cloth Segmentation for initial input, and concludes with the application of DeepFake techniques.

Updated 149 runs

AI Photorealistic Image Ultra-Resolution, Restoration and Upscale!

Updated 73.8K runs

SDXL LoRA finetuned on spectrograms of Beethoven songs

Updated 16 runs

Transfer empty room into fabulous interior design

Updated 2.4K runs

Guided Text to Speech Generator

Updated 371 runs

viⓍTTS vixTTS là mô hình tạo sinh giọng nói cho phép bạn sao chép giọng nói sang các ngôn ngữ khác nhau chỉ bằng cách sử dụng một đoạn âm thanh nhanh dài 6 giây

Updated 399 runs

Given image of an face, the it generates full images with given prompt

Updated 167 runs

Jina Turbo Reranker that is small but performant

Updated 4 runs

SDXL based text-to-image model applying Distribution Matching Distillation, supporting zero-shot identity generation in 2-5s. https://ai-visionboard.com

Updated 4.5M runs

Fast sdxl with higher quality

Updated 659.4K runs

for test

Updated 246 runs

A text-to-image generative AI model that creates beautiful images

Updated 75.8M runs

A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Updated 331 runs

Use a face to make images. Uses SDXL fine-tuned checkpoints.

Updated 155.2K runs

This is phi-3-vision model , cost by time ,have fun~

Updated 13K runs

Convert story to StableDiffusion prompts format

Updated 40 runs

openai whisper model on A100 hardware

Updated 48 runs

Image tagger fine-tuned on WaifuDiffusion w/ (SwinV2, SwinV2, ConvNext, and ViT)

Updated 988 runs

✍️✨Prompts to auto-magically relights your images

Updated 180.1K runs

Replicate version from the work of Shanglin Li et al. called "ZONE: Zero-Shot Instruction-Guided Local Editing"

Updated 143 runs

🖼️✨Background images + prompts to auto-magically relights your images (+normal maps🗺️)

Updated 9.1K runs

Updated 19 runs

This is an ML model to segment hairs in pictures.

Updated 98 runs

Segment foreground objects with high resolution and matting, using InSPyReNet

Updated 681.6K runs

Updated to OpenVoice v2: Versatile Instant Voice Cloning

Updated 48.5K runs

Updated 5.9K runs

Three models in one Cog: Absolute Reality v1.8.1, DreamShaper v8 and Meina V4

Updated 22K runs

Updated 404 runs

Source: gradientai/Llama-3-8B-Instruct-Gradient-4194k ✦ Quant: solidrust/Llama-3-8B-Instruct-Gradient-4194k-AWQ ✦ Extending LLama-3 8B's context length from 8k to 4194K

Updated 141 runs

CLIP Interrogator for SDXL optimizes text prompts to match a given image

Updated 845.9K runs

📖 PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Updated 1.6M runs

An example model created from cli

Updated 16 runs

PaliGemma 3B, an open VLM by Google, pre-trained with 224*224 input images and 128 token input/output text sequences

Updated 561 runs

A model which generates text in response to an input image and prompt.

Updated 1.5M runs

Generate image with transparent background

Updated 628 runs

Yi-1.5 is continuously pre-trained on Yi with a high-quality corpus of 500B tokens and fine-tuned on 3M diverse fine-tuning samples

Updated 62 runs

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view LRMs

Updated 230.7K runs

Blip 3 / XGen-MM, Answers questions about images ({blip3,xgen-mm}-phi3-mini-base-r-v1)

Updated 1.1M runs

Dolphin is uncensored. I have filtered the dataset to remove alignment and bias. This makes the model more compliant.

Updated 1.2K runs

return CLIP features for the dfn5b-clip-vit-h-14-384, current highest average perf. in openclip models leaderboard.

Updated 383 runs

Dolphin is uncensored. I have filtered the dataset to remove alignment and bias. This makes the model more compliant.

Updated 59.6K runs

Dolphin is uncensored. I have filtered the dataset to remove alignment and bias. This makes the model more compliant.

Updated 5.9K runs

Updated 6.3K runs

Implementation of the RemBG library

Updated 340 runs