Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Get started Learn more

Featured models

black-forest-labs / flux-kontext-max

A premium text-based image editing model that delivers maximum performance and improved typography generation for transforming images through natural language prompts

1.8M runs

black-forest-labs / flux-kontext-pro

A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language

5.6M runs

runwayml / gen4-image

Runway's Gen-4 Image model with references. Use up to 3 reference images to create the exact image you need. Capture every angle.

4.2K runs

black-forest-labs / flux-kontext-dev

Open-weight version of FLUX.1 Kontext

58.2K runs

bytedance / seedream-3

A text-to-image model with support for native high-resolution (2K) image generation

38.3K runs

bytedance / seedance-1-pro

A pro version of Seedance that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 1080p resolution

22.8K runs

bytedance / seedance-1-lite

A video generation model that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 720p resolution

30.3K runs

kwaivgi / kling-v2.1

Use Kling v2.1 to generate 5s and 10s videos in 720p and 1080p resolution from a starting image (image-to-video)

39.5K runs

ideogram-ai / ideogram-v3-quality

The highest quality Ideogram v3 model. v3 creates images with stunning realism, creative designs, and consistent styles

127.4K runs

Official models

Official models are always on, maintained, and have predictable pricing.

black-forest-labs / flux-kontext-max

Generate images, Edit images, Use a face to make images, and Use the FLUX family of models

1.8M runs

black-forest-labs / flux-kontext-pro

Generate images, Edit images, Use a face to make images, and Use the FLUX family of models

5.6M runs

luma / modify-video

Modify a video with style transfer and prompt-based editing

243 runs

runwayml / gen4-image

Runway's Gen-4 Image model with references. Use up to 3 reference images to create the exact image you need. Capture every angle.

4.2K runs

flux-kontext-apps / restyle-video-frame

Use flux-kontext-pro to change the first or last frame of a video. Useful to use as inputs for restyling an entire video in a certain way

190 runs

luma / reframe-image

Change the aspect ratio of any photo using AI (not cropping)

2.6K runs

black-forest-labs / flux-schnell-lora

The fastest image generation model tailored for fine-tuned use

1.9M runs

openai / gpt-4.1-nano

Use LLMs

115.6K runs

openai / gpt-4.1-mini

Use LLMs

773K runs

openai / gpt-4o

Use LLMs

109K runs

meta / llama-guard-4-12b

69 runs

openai / gpt-4o-mini

Use LLMs

1.6M runs

resemble-ai / chatterbox

Generate speech

11.4K runs

View all official models

I want to…

Generate images

Models that generate images from text prompts

Generate videos

Models that create and edit videos

Edit images

Tools for editing images.

Upscale images

Upscaling models that create high-quality images from low-quality images

Generate speech

Convert text to speech

Transcribe speech

Models that convert speech to text

Use LLMs

Models that can understand and generate text

Caption videos

Models that generate text from videos

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Generate music

Models to generate and modify music

Caption images

Models that generate text from images

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Use handy tools

Toolbelt-type models for videos and images.

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Extract text from images

Optical character recognition (OCR) and text extraction

Chat with images

Ask language models about images

Sing with voices

Voice-to-voice cloning and musical prosody

Get embeddings

Models that generate embeddings from inputs

Use a face to make images

Make realistic images of people instantly

Remove backgrounds

Models that remove backgrounds from images and videos

Try for free

Get started with these models without adding a credit card. Whether you're making videos, generating images, or upscaling photos, these are great starting points.

Use the FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Popular models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 3 months, 2 weeks ago 1B runs

openai/whisper

Convert speech in audio to text

Updated 7 months ago 96.3M runs

prunaai/flux.1-dev

This is the fastest Flux Dev endpoint in the world, contact us for more at pruna.ai

Updated 2 weeks, 5 days ago 3.9M runs

fofr/any-comfyui-workflow

Run any ComfyUI workflow. Guide: https://github.com/replicate/cog-comfyui

Updated 6 days, 14 hours ago 5M runs

xinntao/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces*

Updated 2 years, 9 months ago 33.2M runs

beautyyuyanli/multilingual-e5-large

multilingual-e5-large: A multi-language text embedding model

Updated 1 year, 5 months ago 22.3M runs

jaaari/kokoro-82m

Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)

Updated 5 months ago 32.7M runs

vaibhavs10/incredibly-fast-whisper

whisper-large-v3, incredibly fast, powered by Hugging Face Transformers! 🤗

Updated 1 year, 4 months ago 9M runs

Latest models

fictions-ai/autocaption

Automatically add captions to a video

Updated 1 year, 6 months ago 46K runs

bawgz/stable-dripfusion-2

Updated 1 year, 6 months ago 263 runs

musicly-ai/singing_voice_conversion

this is the replicate version of singing_voice_conversion from amphion

Updated 1 year, 6 months ago 573 runs

charlesmccarthy/animagine-xl

Animagine XL 2.0 is an advanced latent text-to-image diffusion model designed to create high-resolution, detailed anime images.

Updated 1 year, 6 months ago 9.3K runs

moayedhajiali/elasticdiffusion

ElasticDiffusion: Training-free Arbitrary Size Image Generation

Updated 1 year, 6 months ago 170 runs

zsxkib/patch-fusion

Super High Quality Depth Maps 🗺️: An End-to-End Tile-Based Framework 🏗️ for High-Resolution Monocular Metric Depth Estimation 🔍📏

Updated 1 year, 6 months ago 374 runs

lucataco/open-dalle-v1.1

A unique fusion that showcases exceptional prompt adherence and semantic understanding, it seems to be a step above base SDXL and a step closer to DALLE-3 in terms of prompt comprehension

Updated 1 year, 6 months ago 127.4K runs

lucataco/diffusion-motion-transfer

Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer

Updated 1 year, 6 months ago 178 runs

kcaverly/nous-hermes-2-yi-34b-gguf

Nous Hermes 2 - Yi-34B is a state of the art Yi Fine-tune, fine tuned on GPT-4 generated synthetic data

Updated 1 year, 6 months ago 11.6K runs

charlesmccarthy/terminus-xl-otaku-v1

Terminus XL Otaku is a latent diffusion model that uses zero-terminal SNR noise schedule and velocity prediction objective at training and inference time.

Updated 1 year, 6 months ago 42 runs

usamaehsan/controlnet-x-majic-mix-realistic-x-ip-adapter

works with inpainting and multi-controlnet + single-controlnet || ip-adapter + without ip adapter

Updated 1 year, 6 months ago 23.8K runs

cjwbw/faster-diffusion

Rethinking the Role of UNet Encoder in Diffusion Models

Updated 1 year, 6 months ago 133 runs

meepo-pro-player/winter-wyvern

Updated 1 year, 6 months ago 269.9K runs

charlesmccarthy/terminus-xl-gamma-v2

Terminus XL Gamma is a new state-of-the-art latent diffusion model that uses zero-terminal SNR noise schedule and velocity prediction objective at training and inference time.

Updated 1 year, 6 months ago 279 runs

tomasmcm/sam-7b

Source: SuperAGI/SAM ✦ Quant: TheBloke/SAM-AWQ ✦ SAM (Small Agentic Model), a 7B model that demonstrates impressive reasoning abilities despite its smaller size

Updated 1 year, 6 months ago 78 runs

fofr/sdxl-multi-controlnet-lora

Multi-controlnet, lora loading, img2img, inpainting

Updated 1 year, 6 months ago 211.7K runs

jd7h/luciddreamer

High-Fidelity Text-to-3D Generation via Interval Score Matching

Updated 1 year, 6 months ago 71 runs

intentface/poro-34b-gguf-checkpoint

Try out akx/Poro-34B-gguf, Q5_K, This is 1000B checkpoint model

Updated 1 year, 6 months ago 28 runs

cloversid099/deepfake

DeepFake AI

Updated 1 year, 6 months ago 63.3K runs

lucataco/singing_voice_conversion

Amphion Singing Voice Conversion: DiffWaveNetSVC

Updated 1 year, 6 months ago 980 runs

zelenioncode/custum_model_safetonsors

DreamBooth safetensors model use RealVisXL

Updated 1 year, 6 months ago 757 runs

fofr/realvisxl-v3

Amazing photorealism with RealVisXL_V3.0, based on SDXL, trainable

Updated 1 year, 6 months ago 747.3K runs

sakemin/all-in-one-music-structure-analyzer

Cog implementation of mir-aidj(Taejun Kim)'s 'All-In-One Music Structure Analyzer'

Updated 1 year, 6 months ago 26.2K runs

lucataco/ip-adapter-faceid

(Research only) IP-Adapter-FaceID can generate various style images conditioned on a face with only text prompts

Updated 1 year, 6 months ago 30.8K runs

culturecloud/dreamshaper-xl-turbo

DreamShaper is a general purpose SD model that aims at doing everything well, photos, art, anime, manga. It's designed to go against other general purpose models and pipelines like Midjourney and DALL-E.

Updated 1 year, 6 months ago 1.4K runs

fermatresearch/dpo-sdxl-controlnet-lora

DPO-SDXL Canny controlnet with LoRA support.

Updated 1 year, 6 months ago 774 runs

leandroamaral/segmentanything

Segment Anything MASK

Updated 1 year, 6 months ago 1.3K runs

lucataco/dreamshaper-xl-turbo

DreamShaper is a general purpose SD model that aims at doing everything well, photos, art, anime, manga. It's designed to match Midjourney and DALL-E.

Updated 1 year, 6 months ago 223.2K runs

lucataco/dpo-sdxl

Direct Preference Optimization (DPO) is a method to align diffusion models to text human preferences by directly optimizing on human comparison data

Updated 1 year, 6 months ago 2.2K runs

zust-ai/zust-diffusion

auto1111_ds8

Updated 1 year, 6 months ago 61.8K runs

lucataco/seamless_communication

FacebookResearch/SeamlessM4T v2 - Massively Multilingual & Multimodal Machine Translation

Updated 1 year, 6 months ago 806 runs

anotherjesse/sdxl-lcm-testing

Updated 1 year, 6 months ago 365 runs

kcaverly/openchat-3.5-1210-gguf

The "Overall Best Performing Open Source 7B Model" for Coding + Generalization or Mathematical Reasoning

Updated 1 year, 6 months ago 26.3K runs

carcruz97/scaling-model-v3

Updated 1 year, 6 months ago 64 runs

tomasmcm/prometheus-13b-v1.0

Source: kaist-ai/prometheus-13b-v1.0 ✦ Quant: TheBloke/prometheus-13B-v1.0-AWQ ✦ An alternative to GPT-4 when evaluating LLMs & Reward models for RLHF

Updated 1 year, 6 months ago 54K runs