Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Get started Learn more

Featured models

google / imagen-4

Preview of Google's Imagen-4 flagship model. As a preview, this model might change.

64.5K runs

replicate / fast-flux-trainer

Train subjects or styles faster than ever

1.2K runs

google / lyria-2

Lyria 2 is a music generation model that produces 48kHz stereo audio through text-based prompts

2.1K runs

anthropic / claude-4-sonnet

Claude Sonnet 4 is a significant upgrade to 3.7, delivering superior coding and reasoning while responding more precisely to your instructions

18.2K runs

pixverse / pixverse-v4.5

Quickly make 5s or 8s videos at 540p, 720p or 1080p. It has enhanced motion, prompt coherence and handles complex actions well.

12.8K runs

openai / gpt-4.1

OpenAI's Flagship GPT model for complex tasks.

2.7K runs

prunaai / vace-14b

This is VACE-14B model optimised with pruna ai. Wan2.1 VACE is an all-in-one model for video creation and editing.

3.8K runs

minimax / speech-02-hd

Text-to-Audio (T2A) that offers voice synthesis, emotional expression, and multilingual capabilities. Optimized for high-fidelity applications like voiceovers and audiobooks.

40.7K runs

ideogram-ai / ideogram-v3-turbo

Turbo is the fastest and cheapest Ideogram v3. v3 creates images with stunning realism, creative designs, and consistent styles

66.6K runs

Official models

Official models are always on, maintained, and have predictable pricing.

leonardoai / phoenix-1.0

Leonardo AI’s first foundational model produces images up to 5 megapixels (fast, quality and ultra modes)

165 runs

google / lyria-2

Lyria 2 is a music generation model that produces 48kHz stereo audio through text-based prompts

2.1K runs

luma / ray-flash-2-720p

Generate videos

17.2K runs

luma / ray-2-720p

Generate videos

18.9K runs

luma / ray-flash-2-540p

Generate videos

13.7K runs

luma / ray-2-540p

Generate videos

3.2K runs

luma / ray

Generate videos

34.5K runs

google / veo-2

Generate videos

56.5K runs

black-forest-labs / flux-dev-lora

Generate images

1.8M runs

anthropic / claude-4-sonnet

Claude Sonnet 4 is a significant upgrade to 3.7, delivering superior coding and reasoning while responding more precisely to your instructions

18.2K runs

kwaivgi / kling-v1.6-pro

Generate videos

337.4K runs

kwaivgi / kling-v1.6-standard

Generate videos

635.5K runs

pixverse / pixverse-v4.5

Generate videos

12.8K runs

pixverse / pixverse-v4

Generate videos

13.6K runs

openai / o4-mini

Generate text

1K runs

openai / o1

Generate text

2K runs

View all official models

I want to…

Generate images

Models that generate images from text prompts

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Generate videos

Models that create and edit videos

Caption images

Models that generate text from images

Transcribe speech

Models that convert speech to text

Edit images

Tools for manipulating images.

Use a face to make images

Make realistic images of people instantly

Get embeddings

Models that generate embeddings from inputs

Generate speech

Convert text to speech

Generate music

Models to generate and modify music

Generate text

Models that can understand and generate text

Use handy tools

Toolbelt-type models for videos and images.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Caption videos

Models that generate text from videos

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Remove backgrounds

Models that remove backgrounds from images and videos

Detect objects

Models that detect or segment objects in images and videos.

Sing with voices

Voice-to-voice cloning and musical prosody

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Chat with images

Ask language models about images

Extract text from images

Optical character recognition (OCR) and text extraction

Use the FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Popular models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 2 months, 1 week ago 964.6M runs

beautyyuyanli/multilingual-e5-large

multilingual-e5-large: A multi-language text embedding model

Updated 1 year, 4 months ago 19.6M runs

jaaari/kokoro-82m

Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)

Updated 3 months, 4 weeks ago 26.9M runs

openai/whisper

Convert speech in audio to text

Updated 6 months ago 89.7M runs

vaibhavs10/incredibly-fast-whisper

whisper-large-v3, incredibly fast, powered by Hugging Face Transformers! 🤗

Updated 1 year, 3 months ago 7M runs

xinntao/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces*

Updated 2 years, 8 months ago 30.3M runs

bytedance/hyper-flux-8step

Hyper FLUX 8-step by ByteDance

Updated 2 months, 1 week ago 12.7M runs

philz1337x/clarity-upscaler

High resolution image Upscaler and Enhancer. Use at ClarityAI.co. A free Magnific alternative. Twitter/X: @philz1337x

Updated 11 months ago 14.5M runs

Latest models

zsxkib/sonic

Generates realistic talking face animations from a portrait image and audio using the CVPR 2025 Sonic model

Updated 1 month, 3 weeks ago 1.6K runs

zsxkib/infinite-you

Transform your portrait photos into any style or setting while preserving your facial identity

Updated 1 month, 3 weeks ago 6.4K runs

lucataco/wan-2.1-1.3b-vid2vid

Wan 2.1 1.3b Video to Video. Wan is a powerful visual generation model developed by Tongyi Lab of Alibaba Group

Updated 1 month, 3 weeks ago 652 runs

zsxkib/create-video-dataset

Easily create video datasets with auto-captioning for Hunyuan-Video LoRA finetuning

Updated 1 month, 3 weeks ago 602 runs

zsxkib/mmaudio-t4

Cost-optimized MMAudio V2 (T4 GPU): Add sound to video using this version running on T4 hardware for lower cost. Synthesizes high-quality audio from video content.

Updated 1 month, 3 weeks ago 279 runs

zsxkib/mmaudio

Add sound to video using the MMAudio V2 model. An advanced AI model that synthesizes high-quality audio from video content, enabling seamless video-to-audio transformation.

Updated 1 month, 3 weeks ago 492.9K runs

ostris/flex-redux

A redux adapter trained from scratch on Flex.1-alpha, that also works with FLUX.1-dev

Updated 1 month, 3 weeks ago 177 runs

sruthiselvaraj/indicparlertts

Indic Parler-TTS Pretrained is a multilingual Indic extension of Parler-TTS Mini.

Updated 1 month, 3 weeks ago 37 runs

nvidia/sana-sprint-1.6b

SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation

Updated 1 month, 3 weeks ago 172.4K runs

lightweight-ai/model1

flux_schnell model img2img inference

Updated 1 month, 3 weeks ago 116.3K runs

black-forest-labs/flux-schnell-lora

The fastest image generation model tailored for fine-tuned use

Updated 1 month, 4 weeks ago 1.3M runs

black-forest-labs/flux-fill-dev

Open-weight inpainting model for editing and extending images. Guidance-distilled from FLUX.1 Fill [pro].

Updated 1 month, 4 weeks ago 435K runs

black-forest-labs/flux-1.1-pro-ultra

FLUX1.1 [pro] in ultra and raw modes. Images are up to 4 megapixels. Use raw mode for realism.

Updated 1 month, 4 weeks ago 13M runs

black-forest-labs/flux-1.1-pro

Faster, better FLUX Pro. Text-to-image model with excellent image quality, prompt adherence, and output diversity.

Updated 1 month, 4 weeks ago 35.8M runs

black-forest-labs/flux-pro

State-of-the-art image generation with top of the line prompt following, visual quality, image detail and output diversity.

Updated 1 month, 4 weeks ago 11.7M runs

black-forest-labs/flux-fill-pro

Professional inpainting and outpainting model with state-of-the-art performance. Edit or extend images with natural, seamless results.

Updated 1 month, 4 weeks ago 1.4M runs

black-forest-labs/flux-canny-pro

Professional edge-guided image generation. Control structure and composition using Canny edge detection

Updated 1 month, 4 weeks ago 259.1K runs

black-forest-labs/flux-depth-pro

Professional depth-aware image generation. Edit images while preserving spatial relationships.

Updated 1 month, 4 weeks ago 186K runs

ostris/flux-dev-lora-trainer

Fine-tune FLUX.1-dev using ai-toolkit

Updated 1 month, 4 weeks ago 638.1K runs

scademyai/bertiment

Simple binary sentiment analysis with BERT

Updated 1 month, 4 weeks ago 43 runs

aaronjmars/triposg

TripoSG unofficial implementation

Updated 1 month, 4 weeks ago 248 runs

aisha-ai-official/projectil-v3

Updated 1 month, 4 weeks ago 2.2K runs

aisha-ai-official/flux.1dev-uncensored-jibmix

Updated 1 month, 4 weeks ago 1K runs

goodguy1963/sdxl-finetunes-img2img

Updated 1 month, 4 weeks ago 43 runs

fire/trellis

For the paper "Structured 3D Latents for Scalable and Versatile 3D Generation".

Updated 2 months ago 269 runs

d3vshoaib/andro-upscaler

A model Flux.1-dev-Controlnet-Upscaler by www.androcoders.in

Updated 2 months ago 108 runs

wavespeedai/wan-2.1-t2v-480p

Accelerated inference for Wan 2.1 14B text to video, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.

Updated 2 months ago 100.2K runs

wavespeedai/wan-2.1-t2v-720p

Accelerated inference for Wan 2.1 14B text to video with high resolution, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.

Updated 2 months ago 31.4K runs

wavespeedai/wan-2.1-i2v-480p

Accelerated inference for Wan 2.1 14B image to video, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.

Updated 2 months ago 228.4K runs

wavespeedai/wan-2.1-i2v-720p

Accelerated inference for Wan 2.1 14B image to video with high resolution, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.

Updated 2 months ago 45.7K runs

voodoohop/stable-diffusion-dance

Updated 2 months ago 14 runs

cedoysch/flux-fill-redux-try-on

Virtual fitting of clothes

Updated 2 months ago 1.5K runs

ttsds/whisperspeech

Updated 2 months ago 1.3K runs

noahgsolomon/yumemono

YUMEMONO

Updated 2 months ago 36 runs

deepseek-ai/deepseek-v3

DeepSeek-V3-0324 is the leading non-reasoning model, a milestone for open source

Updated 2 months ago 1.7M runs

goodguy1963/good-sdxl-models-plus-loras

BROKEN - DO NOT USE!

Updated 2 months ago 188 runs

cuuupid/idm-vton

Best-in-class clothing virtual try on in the wild (non-commercial use only)

Updated 2 months ago 780.5K runs

jichengdu/spark-tts

0.5B

Updated 2 months ago 205 runs

ttsds/gptsovits_1

Updated 2 months ago 240 runs

jichengdu/llasa

8B TTS

Updated 2 months ago 76 runs

recraft-ai/recraft-v3

Recraft V3 (code-named red_panda) is a text-to-image model with the ability to generate long texts, and images in a wide list of styles. As of today, it is SOTA in image generation, proven by the Text-to-Image Benchmark by Artificial Analysis

Updated 2 months ago 3.6M runs

recraft-ai/recraft-v3-svg

Recraft V3 SVG (code-named red_panda) is a text-to-image model with the ability to generate high quality SVG images including logotypes, and icons. The model supports a wide list of styles.

Updated 2 months ago 122.1K runs