Explore

Generated with davisbrown/flux-half-illustration

Fine-tune FLUX

Customize FLUX.1 [dev] with Ostris's AI Toolkit on Replicate. Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. (Generated with davisbrown/flux-half-illustration.)

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

Upload an image or video, and Video-LLaVa will give you a text description of what it "sees."

Updated 97 runs

without examination qwen2.5 32b

Updated 270 runs

FLUX.1 [dev] (LoRA) with several optimizations such as FP8 Quantization

Updated 76 runs

Clean Text from Manhwa/Manhua

Updated 8 runs

# Interior Decoration Space Scaling - Second Use Case

Updated 70 runs

a model to get images

Updated 276 runs

Updated 416 runs

This model is used to generate speech

Updated 35 runs

A F5-TTS fine-tuned for Spanish

Updated 500 runs

Updated 11 runs

Updated 24 runs

Updated 21 runs

Dreamlike Diffusion Model for Splurge Art

Updated 2.5K runs

From Sketch to Reality: Transforming Outlines into Lifelike Images

Updated 47.9K runs

baby transformer for blog post

Updated 30 runs

staging testing

Updated 258 runs

Document translation with contextual integrity.

Updated 57 runs

Align text to audio with exact word timings. All characters supported!

Updated 112.9K runs

Projection module trained to add vision capabilties to Llama 3 using SigLIP

Updated 5.6K runs

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Updated 23 runs

Apple's monocular depth estimation foundation model (Depth Pro)

Updated 1.6K runs

OmniGen: Unified Image Generation

Updated 12.2K runs

Create audio clips from text

Updated 9 runs

Explorador FLUX.1-Dev LoRA

Updated 87 runs

Updated 59 runs

Fine-tune StableDiffusion3.5-Large with Hugging Face Diffusers

Updated 572 runs

Updated 24 runs

Run any python code

Updated 6.6K runs

Ostris AI-Toolkit for StableDiffusion3.5-Large LoRA Training

Updated 273 runs

stability-ai/stable-diffusion-3.5-medium

2.5 billion parameter image model with improved MMDiT-X architecture

Updated 42.6K runs

Updated 685 runs

Analyzes music to determine song structure, bpm, downbeats, and demuxes audio

Updated 645 runs

Sayak Paul's cartoonizer, deployed to replicate. Here's the model: https://huggingface.co/instruction-tuning-sd/cartoonizer

Updated 173 runs

Updated 13 runs

flux.1-lite-8B-alpha by Freepik

Updated 320 runs

Updated 42 runs

Stable Diffusion 3.5 Large - LoRA Explorer

Updated 1.9K runs

One shot portrait maker.

Updated 30.2K runs

Remove Background of video and add yours

Updated 322 runs

Updated 432 runs

Updated 78 runs

Powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text prompt

Updated 131 runs

fancyfeast/joytag

Updated 18.4K runs

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching. Voice cloning

Updated 19.7K runs

Updated 115 runs

ChatTTS is a text-to-speech model designed specifically for dialogue scenarios such as LLM assistant.

Updated 133 runs

stability-ai/stable-diffusion-3.5-large-turbo

A text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, with a focus on fewer inference steps

Updated 354.2K runs

stability-ai/stable-diffusion-3.5-large

A text-to-image model that generates high-resolution images with fine details. It supports various artistic styles and produces diverse outputs from the same prompt, thanks to Query-Key Normalization.

Updated 1.3M runs

ideogram-ai/ideogram-v2-turbo

A fast image model with state of the art inpainting, prompt comprehension and text rendering.

Updated 1.9M runs