I want to…

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Upscale images

Upscaling models that create high-quality images from low-quality images

Train a language model

Language models that you can fine-tune using Replicate's training API.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Latest models

OpenBMB MiniCPM-V 2.8B is a strong multimodal large language model for efficient end-side deployment

Updated 13 runs

A tiny model for testing out Cog

Updated 42 runs

a powerful and competitive model like Midjourney v6 and DALL-E 3 but Open and Decentralized

Updated 47 runs

HairFastGAN: Realistic and Robust Hair Transfer with a Fast Encoder-Based Approach

Updated 30 runs

PyTorch implementation of AnimeGAN for fast photo animation

Updated 30.7K runs

AbsoluteReality V1.8.1 Model (Text2Img, Img2Img and Inpainting)

Updated 5.1K runs

Updated 1.4K runs

An example of a rudimentary Q&A assistant for ACME SL

Updated 7 runs

ZeST: Zero-Shot Material Transfer from a Single Image

Updated 72 runs

Reliberate v3 Model (Text2Img, Img2Img and Inpainting)

Updated 162.6K runs

Deliberate V6 Model (Text2Img, Img2Img and Inpainting)

Updated 1.7K runs

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

Updated 515 runs

WizardLM 2 8x22B

Updated 80 runs

Updated 77 runs

lightweight text-to-speech (TTS) model, trained on 10.5K hours of audio data

Updated 88 runs

Accelerated transcription, word-level timestamps and diarization with whisperX large-v3

Updated 93.5K runs

txt2img model based on photon-v1 checkpoint model

Updated 2.2K runs

Change eye (iris) color

Updated 156 runs

Updated 543 runs

Mixtral 8x22b v0.1 Zephyr Orpo 141b A35b v0.1

Updated 98 runs


Updated 169 runs

Midjourney v6 text-to-image quality model but Open and Decentralized

Updated 241 runs

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Updated 686 runs

GPU accelerated replay renderer / video data clipper for comma.ai connect's openpilot route data. SEE README.

Updated 1.9K runs

Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Updated 388 runs

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Updated 164 runs

Updated 71 runs

A large, stereo MusicGen that acts as a useful tool for music producers

Updated 160 runs

Nous Hermes 2 Mixtral 8x7B DPO is a Nous Research model trained over the Mixtral 8x7B MoE LLM

Updated 28 runs

High resolution image Upscaler and Enhancer. Use at ClarityAI.cc. A free Magnific alternative. Twitter/X: @philz1337x

Updated 228.8K runs


Updated 132 runs

Best-in-class virtual try on in the wild

Updated 1.6K runs

Image generation, Added: inpaint_strength loras_custom_urls

Updated 403 runs

Updated 1K runs

Use a subset of https://github.com/barun-saha/slide-deck-ai to create powerpoint slides from a json description - using python-pptx (https://github.com/scanny/python-pptx)

Updated 72 runs

Generates Images in the Big Medium Style

Updated 323 runs

multilingual text2image latent diffusion model

Updated 8.6M runs

viⓍTTS vixTTS là mô hình tạo sinh giọng nói cho phép bạn sao chép giọng nói sang các ngôn ngữ khác nhau chỉ bằng cách sử dụng một đoạn âm thanh nhanh dài 6 giây

Updated 55 runs

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

Updated 347 runs

Turn a face into 3D, emoji, pixel art, video game, claymation or toy

Updated 10.4M runs

EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture Modeling

Updated 24 runs

Free Lunch towards Style-Preserving in Text-to-Image Generation by InstantX team, with ControlNet

Updated 238 runs

繁花 style 测试

Updated 75 runs

Free Lunch towards Style-Preserving in Text-to-Image Generation by InstantX team

Updated 516 runs

Updated 395 runs

MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens

Updated 82 runs

Newest balance-striking reranker model from BAAI. Outputs rank scores for query-doc pairs. FP16 inference enabled.

Updated 106 runs

Open Sora Plan Text To Video

Updated 716 runs

Domain Consistent Resolution Adapter for Diffusion Models: generating consistent images with resolutions outside of their trained domain

Updated 1.1K runs

Updated 161 runs