Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Official models

Official models are always on, maintained, and have predictable pricing.

View all official models

I want to…

Upscale images

Upscaling models that create high-quality images from low-quality images

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Try for free

Get started with these models without adding a credit card. Whether you're making videos, generating images, or upscaling photos, these are great starting points.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Latest models

Generates unrestricted images from text prompts using a fine-tuned Stable Diffusion model

Updated 89 runs

Open-weight version of FLUX.1 Kontext

Updated 22.9K runs

Recognise, describe and retrieve data within an image with great accuracy.

Updated 10 runs

[Quality Mode] Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

Updated 430 runs

Updated 33 runs

Updated 31.3K runs

Run any ComfyUI workflow. Guide: https://github.com/replicate/cog-comfyui

Updated 4.4M runs

Fast endpoint for Flux Kontext, optimized with pruna framework

Updated 5.6K runs

Converts images or text into 512-dimensional vector embeddings.

Updated 5.2K runs

A text-to-image model with support for native high-resolution (2K) image generation

Updated 20K runs

A version of flux-dev, a text to image model, that supports fast fine-tuned lora inference

Updated 2.6M runs

A 12 billion parameter rectified flow transformer capable of generating images from text descriptions

Updated 21M runs

The fastest image generation model tailored for local development and personal use

Updated 387.1M runs

OmniGen2: a powerful and efficient unified multimodal model

Updated 391 runs

A pro version of Seedance that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 1080p resolution

Updated 12.5K runs

A video generation model that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 720p resolution

Updated 18.9K runs

The fastest image generation model tailored for fine-tuned use

Updated 1.8M runs

Generate vector embeddings from various content types including text, images, audio, and PDF files.

Updated 2 runs

Quickly detect nudity, violence, hentai, porn and more NSFW content in images.

Updated 2 runs

Translate text from one language to another with support for multiple text formats.

Updated 3 runs

Transform text into natural-sounding human-like AI voices with low latency and exceptional quality.

Updated 10 runs

Recognise objects within an image with great accuracy.

Updated 6 runs

Uses DINO to detect regions and further refines them with SAM. Returns masking data as RLE encoded JSON.

Updated 326 runs

Effortlessly search the Web and get access to high-quality results powered with AI.

Updated 4 runs

Generate an image based on the given text by employing AI models like Flux, Stable Diffusion, and other top models.

Updated 5 runs

Generate an image based on the given text by employing AI models like Flux, Stable Diffusion, and other top models.

Updated 11 runs

Translate text from one language to another with support for multiple text formats.

Updated 7 runs

OneFormer: One Transformer to Rule Universal Image Segmentation

Updated 38 runs

Fastest, most cost-effective GPT-4.1 model from OpenAI

Updated 48.9K runs

Fast, affordable version of GPT-4.1

Updated 770.3K runs

OpenAI's high-intelligence chat model

Updated 104.6K runs

Updated 56 runs

Low latency, low cost version of OpenAI's GPT-4o model

Updated 1.6M runs

Updated 158 runs

A project that I recycled for many of my classes as their Final Project assignments.

Updated 106 runs

[Turbo Mode] Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

Updated 13.6K runs

Updated 283 runs

Document Image Parsing via Heterogeneous Anchor Prompting

Updated 59 runs

Generate expressive, natural speech. Features unique emotion control, instant voice cloning from short audio, and built-in watermarking.

Updated 8.8K runs

flip-safe-detector checks whether an image (such as a logo) contains visible or readable text that would be distorted or reversed if horizontally flipped. It is especially useful in automation pipelines, design tools, or print workflows where flipping an

Updated 15 runs

Converts any logo image to pure white while preserving transparency. Perfect for mockups, printing, or overlaying on dark backgrounds. Accepts PNG or JPG input and returns a whitened version of the logo in seconds.

Updated 11 runs

A premium version of Kling v2.1 with superb dynamics and prompt adherence. Generate 1080p 5s and 10s videos from text or an image

Updated 5.5K runs

Use Kling v2.1 to generate 5s and 10s videos in 720p and 1080p resolution from a starting image (image-to-video)

Updated 22.3K runs

Generate 6s videos with prompts or images. (Also known as Hailuo). Use a subject reference to make a video with a character and the S2V-01 model.

Updated 524K runs

An image-to-video (I2V) model specifically trained for Live2D and general animation use cases

Updated 125.9K runs

Generate videos with specific camera movements

Updated 39.6K runs

LBM: Latent Bridge Matching for Fast Image-to-Image Translation

Updated 20 runs

Generate expressive, natural speech with Resemble AI's Chatterbox.

Updated 1.2K runs

Sound on: Google’s flagship Veo 3 text to video model, with audio

Updated 95.9K runs

Latest Pony Realism Model. Try it with WEIGHTS on creatorframes.com

Updated 4.2K runs