Explore

Generated with davisbrown/flux-half-illustration

Fine-tune FLUX

Customize FLUX.1 [dev] with Ostris's AI Toolkit on Replicate. Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. (Generated with davisbrown/flux-half-illustration.)

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Updated 61.3K runs

Updated 493 runs

Qwen 2: A 7 billion parameter language model from Alibaba Cloud, fine tuned for chat completions

Updated 1.7K runs

Qwen 2: A 1.5 billion parameter language model from Alibaba Cloud, fine tuned for chat completions

Updated 217 runs

A novel speech model for insane prosody.

Updated 470 runs

Qwen 2: A 0.5 billion parameter language model from Alibaba Cloud, fine tuned for chat completions

Updated 199 runs

good for video teaser backsound

Updated 58 runs

High resolution image Upscaler and Enhancer. Use at ClarityAI.co. A free Magnific alternative. Twitter/X: @philz1337x

Updated 13.5M runs

Updated 512 runs

✨Stable Diffusion 3 w/ ⚡InstantX's Canny, Pose, and Tile ControlNets🖼️

Updated 1.2K runs

A model for experimenting with all the SD3 settings. Non-commercial use only, unless you have a Stability AI Self Hosted License.

Updated 32.2K runs

Updated 181 runs

Stable Audio Open is an open-source model optimized for generating short audio samples, sound effects, and production elements using text prompts.

Updated 10.8K runs

Stable Diffusion 3 medium with added variability in outputs. Non-commercial use only, unless you have a Stability AI Self Hosted License.

Updated 20.2K runs

Transcribe saxophone solos directly from audio

Updated 185 runs

Real-Time Open-Vocabulary Object Detection using the xl weights

Updated 212.7K runs

MusicGen running on an a40 with 60 seconds max duration

Updated 834 runs

Updated 171 runs

Mobius, a diffusion model that pushes the boundaries of domain-agnostic debiasing and representation realignment

Updated 621 runs

DOVER video quality assessment tool, assigning videos both aesthetic and technical quality scores

Updated 27 runs

Generate Product photography backgrounds using Stable Diffusion

Updated 520 runs

DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation. Hologram optimized

Updated 343 runs

Transfer learning models for music classification by genres, moods, and instrumentation

Updated 10.2K runs

🫦 Realistic facial expression manipulation (lip-syncing) using audio or video

Updated 1K runs

Super fast clothing (and face) segmentation and masking with erosion and dilation capability, made for https://outfit.fm

Updated 16.1K runs

The best Pony-SDXL models! Current one is based on Pony Realism.

Updated 87.5K runs

Updated 181 runs

# Interior Decoration Space Scaling - First Use Case

Updated 66 runs

A tiny model for testing out Cog

Updated 1.1K runs

Updated 8.2K runs

Updated 1.6K runs

Create images of a given character in different poses

Updated 974.7K runs

Updated 172 runs

Real-Time High Quality Lip Synchronization with Latent Space Inpainting

Updated 2.5K runs

Turns 10 mp4 into 1

Updated 72 runs

An improved outpainting model that supports LoRA urls. This model uses PatchMatch to improve the mask quality.

Updated 80.6K runs

Llama-3-8B finetuned with ReFT to hyperfocus on New Jersey, the Garden State, the best state, the only state!

Updated 106 runs

🎙️Hololive text-to-speech and voice-to-voice (Japanese🇯🇵 + English🇬🇧)

Updated 854 runs

Upscaler and detailer for a selected area

Updated 4.8K runs

Convert LLM's coding to image generation

Updated 1.9K runs

epiCRealism v7-Final Destination. Top Realism Model on Civitai

Updated 1.7K runs

blue_pencil-XL meets ANIMAGINE XL 3.0 / ANIMAGINE XL 3.1, The top ranked model on Civitai

Updated 3.8K runs

Updated 118 runs

This is an implementation of the ChatTTS as a Cog model.

Updated 3.1K runs

Stylized Audio-Driven Single Image Talking Face Animation

Updated 130.4K runs

Recreate images with Emojis

Updated 203 runs

Fast and High-Quality Text-to-video Generation

Updated 4.6K runs

A PhotoBooth style transfer workflow that utilizes IPadapter Style, Canny, OpenPose, RemoveBackground, HumanSegmentation, Cloth Segmentation for initial input, and concludes with the application of DeepFake techniques.

Updated 181 runs