Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Official models

Official models are always on, maintained, and have predictable pricing.

View all official models

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Upscale images

Upscaling models that create high-quality images from low-quality images

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

SDVN10-Anime

Updated 345 runs

Improved background remover 2.0 - GroundingDino + SAM + Inpainting SDXL + Controlnet Canny

Updated 205 runs

Automatic Speech Recognition with Word-level Timestamps & Diarization

Updated 4.3K runs

Towards Photo-Realistic Image Colorization via Dual Decoders

Updated 361.4K runs

Source: Neuronovo/neuronovo-7B-v0.3 ✦ Quant: TheBloke/neuronovo-7B-v0.3-AWQ ✦ Neuronovo/neuronovo-7B-v0.3 model represents an advanced and fine-tuned version of a large language model, initially based on CultriX/MistralTrix-v1.

Updated 42 runs

Yuan2.0 is a new generation LLM developed by IEIT System, enhanced the model's understanding of semantics, mathematics, reasoning, code, knowledge, and other aspects.

Updated 388 runs

SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution

Updated 81.4K runs

Pheme generates a variety of conversational voices in 16 kHz for phone-call applications

Updated 525 runs

90s anime

Updated 21.3K runs

multilingual-e5-large: A multi-language text embedding model

Updated 21.4M runs

multilingual-e5-base: A multi-language text embedding model

Updated 10 runs

multilingual-e5-small: A multi-language text embedding model

Updated 15 runs

Amused is a lightweight text to image model based off of the muse architecture. Amused is particularly useful in applications that require a lightweight and fast model such as generating many images quickly at once.

Updated 196 runs

Super-fast, 0.6s per image. LCM with img2img, large batching and canny controlnet

Updated 1.4M runs

Cheaper model SwinIR: Image Restoration Using Swin Transformer (analogue of the popular model: jingyunliang/swinir)

Updated 857 runs

A SOTA Nous Research finetune of 200k Yi-34B fine tuned on the Capybara dataset.

Updated 1.9K runs

Video toolkit – convert, make GIFs, extract audio

Updated 8K runs

Diffusion Models for Image Morphing

Updated 1.2K runs

A 34 billion parameter Llama tuned for coding and conversation

Updated 155.8K runs

A 7 billion parameter Llama tuned for coding and conversation

Updated 65.8K runs

(Academic and Non-commercial use only) Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization

Updated 40.8K runs

Lob RealVis XL

Updated 60.6K runs

whisper-large-v3, incredibly fast, with video transcription

Updated 128.5K runs

RESEARCH/NON-COMMERCIAL USE ONLY: diffusion-based audio-driven expressive talking head generation

Updated 1K runs

SDXL LoRA finetuned on diamond watches

Updated 54 runs

SDXL LoRA finetuned on Vermeer paintings

Updated 89 runs

SDXL using DeepCache

Updated 3.9K runs

Chest X ray

Updated 2.6K runs

Anydoor: zero-shot object-level image customization

Updated 2.1K runs

RealVisXl V3 with multi-controlnet, lora loading, img2img, inpainting

Updated 1.6M runs

Source: pipizhao/Pandalyst-7B-V1.2 ✦ Quant: TheBloke/Pandalyst-7B-v1.2-AWQ ✦ Pandalyst: A large language model for mastering data analysis using pandas

Updated 20 runs

Honeycomb NLQ Generator

Updated 41 runs

RESEARCH/NON-COMMERCIAL USE ONLY: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models

Updated 127.5K runs

fastai lesson 1 - bird or forest

Updated 229 runs

This is the chat model finetuned on top of TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T

Updated 643 runs

🗣️ Nvidia + Suno.ai's speech-to-text conversion with high accuracy and efficiency 📝

Updated 1.9K runs

SDXL LoRA finetuned on Basquiat Paintings

Updated 173 runs

Nougat: Neural Optical Understanding for Academic Documents

Updated 234 runs

Versatile Audio Super-resolution at Scale which upsamples audio files to 48khz. Longer audio input is possible with this model

Updated 2.5K runs

Detecting Twenty-thousand Classes using Image-level Supervision

Updated 282 runs

Source: TinyLlama/TinyLlama-1.1B-Chat-v1.0 ✦ Quant: TheBloke/TinyLlama-1.1B-Chat-v1.0-AWQ ✦ The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Updated 109 runs

SDXL LoRA I trained on chihuahua images

Updated 25 runs

PyTSMod is an open-source library for Time-Scale Modification(eg. time-stretching) algorithms, by Sangeon Yong at MAC Lab, KAIST.

Updated 175 runs

Zero-shot classifier which classifies text into categories of your choosing. Returns a dictionary of the most likely class and all class likelihoods.

Updated 3.6K runs

Nous Hermes 2 - SOLAR 10.7B is the flagship Nous Research model on the SOLAR 10.7B base model..

Updated 70.6K runs

Nous Hermes 2 - SOLAR 10.7B is the flagship Nous Research model on the SOLAR 10.7B base model.

Updated 9K runs

Personalized Image Animator

Updated 103.4K runs

Source: Arc53/docsgpt-7b-mistral ✦ Quant: TheBloke/docsgpt-7B-mistral-AWQ ✦ DocsGPT is optimized for Documentation (RAG), fine-tuned for providing answers that are based on context

Updated 77 runs

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration

Updated 4.3M runs