Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Get started Learn more

Featured models

black-forest-labs / flux-kontext-max

A premium text-based image editing model that delivers maximum performance and improved typography generation for transforming images through natural language prompts

10.2K runs

black-forest-labs / flux-kontext-pro

A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language

25.2K runs

google / imagen-4

Preview of Google's Imagen-4 flagship model. As a preview, this model might change.

76K runs

replicate / fast-flux-trainer

Train subjects or styles faster than ever

1.5K runs

anthropic / claude-4-sonnet

Claude Sonnet 4 is a significant upgrade to 3.7, delivering superior coding and reasoning while responding more precisely to your instructions

21.8K runs

pixverse / pixverse-v4.5

Quickly make 5s or 8s videos at 540p, 720p or 1080p. It has enhanced motion, prompt coherence and handles complex actions well.

14.9K runs

prunaai / vace-14b

This is VACE-14B model optimised with pruna ai. Wan2.1 VACE is an all-in-one model for video creation and editing.

4.1K runs

minimax / speech-02-hd

Text-to-Audio (T2A) that offers voice synthesis, emotional expression, and multilingual capabilities. Optimized for high-fidelity applications like voiceovers and audiobooks.

42.5K runs

ideogram-ai / ideogram-v3-turbo

Turbo is the fastest and cheapest Ideogram v3. v3 creates images with stunning realism, creative designs, and consistent styles

70.5K runs

Official models

Official models are always on, maintained, and have predictable pricing.

flux-kontext-apps / text-removal

Remove all text from an image with FLUX.1 Kontext

85 runs

black-forest-labs / flux-kontext-max

Generate images, Use the FLUX family of models, and Edit images

10.2K runs

black-forest-labs / flux-kontext-pro

Generate images, Use the FLUX family of models, Edit images, and Use a face to make images

25.2K runs

flux-kontext-apps / cartoonify

Turn your image into a cartoon with FLUX.1 Kontext [pro]

170 runs

flux-kontext-apps / depth-of-field

Bring your subjects into focus with FLUX.1 Kontext [pro]

132 runs

flux-kontext-apps / impossible-scenarios

Experience impossible adventures and extreme scenarios from a single image

314 runs

flux-kontext-apps / multi-image-kontext

An experimental FLUX Kontext model that can combine two input images

1.1K runs

flux-kontext-apps / iconic-locations

Put yourself in an iconic location around the world from a single image

292 runs

leonardoai / phoenix-1.0

Leonardo AI’s first foundational model produces images up to 5 megapixels (fast, quality and ultra modes)

320 runs

leonardoai / motion-2.0

Generate videos

101 runs

flux-kontext-apps / portrait-series

Create a series of portrait photos from a single image

884 runs

flux-kontext-apps / professional-headshot

Create a professional headshot photo from any single image

812 runs

flux-kontext-apps / change-haircut

Quickly change someone's hair style and hair color, powered by FLUX.1 Kontext [pro]

959 runs

google / lyria-2

Lyria 2 is a music generation model that produces 48kHz stereo audio through text-based prompts

2.3K runs

luma / ray-flash-2-720p

Generate videos

17.3K runs

luma / ray-2-720p

Generate videos

19K runs

luma / ray-flash-2-540p

Generate videos

13.9K runs

View all official models

I want to…

Generate images

Models that generate images from text prompts

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Generate videos

Models that create and edit videos

Caption images

Models that generate text from images

Transcribe speech

Models that convert speech to text

Use the FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Edit images

Tools for manipulating images.

Use a face to make images

Make realistic images of people instantly

Get embeddings

Models that generate embeddings from inputs

Generate speech

Convert text to speech

Generate music

Models to generate and modify music

Generate text

Models that can understand and generate text

Use handy tools

Toolbelt-type models for videos and images.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Caption videos

Models that generate text from videos

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Remove backgrounds

Models that remove backgrounds from images and videos

Detect objects

Models that detect or segment objects in images and videos.

Sing with voices

Voice-to-voice cloning and musical prosody

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Chat with images

Ask language models about images

Extract text from images

Optical character recognition (OCR) and text extraction

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Popular models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 2 months, 1 week ago 966.1M runs

beautyyuyanli/multilingual-e5-large

multilingual-e5-large: A multi-language text embedding model

Updated 1 year, 4 months ago 19.9M runs

jaaari/kokoro-82m

Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)

Updated 4 months ago 27.1M runs

openai/whisper

Convert speech in audio to text

Updated 6 months ago 90M runs

vaibhavs10/incredibly-fast-whisper

whisper-large-v3, incredibly fast, powered by Hugging Face Transformers! 🤗

Updated 1 year, 3 months ago 7M runs

xinntao/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces*

Updated 2 years, 8 months ago 30.4M runs

bytedance/hyper-flux-8step

Hyper FLUX 8-step by ByteDance

Updated 2 months, 1 week ago 12.7M runs

philz1337x/clarity-upscaler

High resolution image Upscaler and Enhancer. Use at ClarityAI.co. A free Magnific alternative. Twitter/X: @philz1337x

Updated 11 months ago 14.5M runs

Latest models

lucataco/sdxl-lightning-multi-controlnet

SDXL lightning mult-controlnet, img2img & inpainting

Updated 1 year, 3 months ago 9.4K runs

lucataco/dreamshaper-xl-lightning

dreamshaper-xl-lightning is a Stable Diffusion model that has been fine-tuned on SDXL

Updated 1 year, 3 months ago 119.6K runs

datacte/proteus-v0.4

ProteusV0.4: The Style Update

Updated 1 year, 3 months ago 111.3K runs

datong-new/rvc

Updated 1 year, 3 months ago 195 runs

adirik/bunny-phi-2-siglip

Lightweight multimodal model for visual question answering, reasoning and captioning

Updated 1 year, 3 months ago 7.8K runs

zust-ai/supir

Updated 1 year, 3 months ago 212.9K runs

magpai-app/chroma-key

Simple video chroma keying

Updated 1 year, 3 months ago 49 runs

adirik/multilingual-e5-small

Multilingual E5-small language embedding model

Updated 1 year, 3 months ago 52 runs

adirik/multilingual-e5-base

Multilingual E5-large language embedding model

Updated 1 year, 3 months ago 45 runs

adirik/multilingual-e5-large

Multilingual E5-large language embedding model

Updated 1 year, 3 months ago 538 runs

simonzeng7108/tea-seg

Tea Segmentation Demo

Updated 1 year, 3 months ago 29 runs

proactive-ingredient/nexus_raven

Function calling LLM that surpasses the state-of-the-art in function calling capabilities

Updated 1 year, 3 months ago 65 runs

cjwbw/opencodeinterpreter-ds-6.7b

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

Updated 1 year, 3 months ago 117 runs

mohamad1998630/controlnet

Updated 1 year, 3 months ago 82 runs

lucataco/animate-diff-vid2vid

AnimateDiff video to video

Updated 1 year, 3 months ago 648 runs

collectiveai-team/speaker-diarization-3

Segments an audio recording based on who is speaking

Updated 1 year, 3 months ago 3K runs

usamaehsan/multi-controlnet-x-ip-adapter-vision-v2

Updated 1 year, 3 months ago 5.1K runs

cjwbw/supir-v0f

Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. This is the SUPIR-v0F model and does NOT use LLaVA-13b.

Updated 1 year, 3 months ago 15.2K runs

cjwbw/supir-v0q

Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. This is the SUPIR-v0Q model and does NOT use LLaVA-13b.

Updated 1 year, 3 months ago 108.8K runs

cjwbw/supir

Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. This version uses LLaVA-13b for captioning.

Updated 1 year, 3 months ago 187.4K runs

lucataco/depth-anything-video-sbs

POC implementation of Depth-anything to produce a 3D SBS video

Updated 1 year, 3 months ago 198 runs

adirik/e5-mistral-7b-instruct

E5-mistral-7b-instruct language embedding model

Updated 1 year, 3 months ago 634 runs

fofr/image-merge-sdxl

Merge two images together with a prompt

Updated 1 year, 3 months ago 6.2K runs

hamelsmu/honeycomb-2

Honeycomb NLQ Generator

Updated 1 year, 3 months ago 181 runs

datacte/proteus-v0.4-lightning

ProteusV0.4: The Style Update - enhances stylistic capabilities, similar to Midjourney's approach, rather than advancing prompt comprehension

Updated 1 year, 3 months ago 131.6K runs

vessl-ai/floyd-cpu-custom-cog-test

hello-world from cog example

Updated 1 year, 3 months ago 34 runs

brewwh/cog-a1111-ui

A collection of anime stable diffusion models with VAEs and LORAs.

Updated 1 year, 3 months ago 3.7K runs

magpai-app/cog-ffprobe

Get the width, height, and duration in seconds from a video

Updated 1 year, 3 months ago 220 runs

google-deepmind/gemma-7b

7B base version of Google’s Gemma model

Updated 1 year, 3 months ago 7.7K runs

google-deepmind/gemma-2b

2B base version of Google’s Gemma model

Updated 1 year, 3 months ago 2.5K runs

google-deepmind/gemma-7b-it

7B instruct version of Google’s Gemma model

Updated 1 year, 3 months ago 88.5K runs

google-deepmind/gemma-2b-it

2B instruct version of Google’s Gemma model

Updated 1 year, 3 months ago 133.5K runs

jd7h/dreamcraft3d

DreamCraft3D is a text and image to 3D model. Dreamcraft3D uses DeepFloyd IF and Stable Zero123, non-commercial research-only models. Please make sure you read and abide to the relevant licenses before using it.

Updated 1 year, 3 months ago 581 runs