Explore

Fine-tune FLUX fast

Customize FLUX.1 [dev] with the fast FLUX trainer on Replicate

Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. It's fast (under 2 minutes), cheap (under $2), and gives you a warm, runnable model plus LoRA weights to download.

Official models

Official models are always on, maintained, and have predictable pricing.

View all official models

I want to…

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Try for free

Get started with these models without adding a credit card. Whether you're making videos, generating images, or upscaling photos, these are great starting points.

Detect objects

Models that detect or segment objects in images and videos.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Latest models

The Qwen3 Embedding model series is specifically designed for text embedding and ranking tasks

Updated 6 runs

Voxtral builds upon Ministral-3B with powerful audio understanding capabilities

Updated 3 runs

Updated 33.9K runs

Use this fast version of Imagen 4 when speed and cost are more important than quality

Updated 130.3K runs

Use this ultra version of Imagen 4 when quality matters more than speed and cost

Updated 129.1K runs

Google's Imagen 4 flagship model

Updated 1.1M runs

A faster and cheaper Imagen 3 model, for when price or speed are more important than final image quality

Updated 265.3K runs

Google's highest quality text-to-image model, capable of generating images with detail, rich lighting and beauty

Updated 1.3M runs

Sound on: Google’s flagship Veo 3 text to video model, with audio

Updated 120K runs

A faster and cheaper version of Google’s Veo 3 video model, with audio

Updated 6K runs

🎧Advanced audio understanding with step-by-step reasoning📣

Updated 77 runs

An AI system that can create realistic images and art from a description in natural language.

Updated 2.8K runs

A multimodal image generation model that creates high-quality images. You need to bring your own verified OpenAI key to use this model. Your OpenAI account will be charged for usage.

Updated 201.9K runs

This model generates beautiful cinematic 2 megapixel images in 3-6 seconds and is derived from the wan2.1 model through optimisation techniques from the pruna package

Updated 1.2K runs

Updated 1 run

Updated 7 runs

Commercial-ready, trained entirely on licensed data, text-to-image model. With only 4B parameters provides exceptional aesthetics and text rendering. Evaluated to be on par to other leading models in the market

Updated 902 runs

Text-guided image editing model that preserves original details while making targeted modifications like lighting changes, object removal, and style conversion

Updated 8.6K runs

A pro version of Seedance that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 1080p resolution

Updated 93.5K runs

A video generation model that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 720p resolution

Updated 120.7K runs

Updated 42 runs

A premium text-based image editing model that delivers maximum performance and improved typography generation for transforming images through natural language prompts

Updated 2.7M runs

A state-of-the-art text-based image editing model that delivers high-quality outputs with excellent prompt following and consistent results for transforming images through natural language

Updated 9.8M runs

Granite-speech-3.3-8b is a compact and efficient speech-language model, specifically designed for automatic speech recognition (ASR) and automatic speech translation (AST).

Updated 13 runs

Recognise objects within an image with great accuracy.

Updated 11 runs

Dia 1.6B by Nari Labs, Generates realistic dialogue audio from text, including non-verbal cues and voice cloning

Updated 6.8K runs

Train subjects or styles faster than ever

Updated 24.7K runs

WhisperX that works with multiple chunks, download, processing and merging the results

Updated 8 runs

Granite-vision-3.3-2b is a compact and efficient vision-language model, specifically designed for visual document understanding, enabling automated content extraction from tables, charts, infographics, plots, diagrams, and more.

Updated 1.2K runs

Remove image background with custom model to better result.

Updated 417 runs

Fast endpoint for Flux Kontext, optimized with pruna framework

Updated 241.2K runs

Audio to text transcriptions

Updated 580 runs

This is the fastest Flux Dev endpoint in the world, contact us for more at pruna.ai

Updated 7.9M runs

This is an optimised version of the hidream-full model using the pruna ai optimisation toolkit!

Updated 23.4K runs

This is an optimised version of the hidream-l1-dev model using the pruna ai optimisation toolkit!

Updated 34.6K runs

This is an optimised version of the hidream-l1 model using the pruna ai optimisation toolkit!

Updated 1.2M runs

Realistic Inpainting with ControlNET (M-LSD + SEG)

Updated 535.4K runs

Simple tool to quickly trim a video

Updated 13 runs