Explore

Generated with davisbrown/flux-half-illustration

Fine-tune FLUX

Customize FLUX.1 [dev] with Ostris's AI Toolkit on Replicate. Train the model to recognize and generate new concepts using a small set of example images, for specific styles, characters, or objects. (Generated with davisbrown/flux-half-illustration.)

I want to…

Make videos with Wan2.1

Generate videos with Wan2.1, the fastest and highest quality open-source video generation model.

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Use official models

Official models are always on, maintained, and have predictable pricing.

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

wan-video/wan-2.1-1.3b

Generate 5s 480p videos. Wan is an advanced and powerful visual generation model developed by Tongyi Lab of Alibaba Group

Updated 14.5K runs

ideogram-ai/ideogram-v2a-turbo

Like Ideogram v2 turbo, but now faster and cheaper

Updated 187K runs

ideogram-ai/ideogram-v2a

Like Ideogram v2, but faster and cheaper

Updated 533.9K runs

Updated 281 runs

Updated 45 runs

Updated 59 runs

In-Context LoRA with Image-to-Image and Inpainting to apply your logo to anything

Updated 8.3K runs

anthropic/claude-3.7-sonnet

The most intelligent Claude model and the first hybrid reasoning model on the market (claude-3-7-sonnet-20250219)

Updated 1.3M runs

Generate high-quality videos from text prompts using StepVideo

Updated 288 runs

minimax/video-01-director

Generate videos with specific camera movements

Updated 23.5K runs

Updated 780 runs

Updated 2.7K runs

Updated 1.2K runs

Updated 255 runs

Updated 293 runs

Make your iPhone photos look like they were taken with an old digital camera

Updated 68 runs

Realistic text-to-image by TiwazM

Updated 1.7K runs

Updated 9.2K runs

Updated 578 runs

Updated 207 runs

Updated 4K runs

Updated 400 runs

Updated 32 runs

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Updated 88 runs

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Updated 26 runs

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Updated 51 runs

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Updated 20 runs

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Updated 204 runs

A version of the DeepSeek-R1 model that has been post trained to provide unbiased, accurate, and factual information by Perplexity

Updated 223 runs

Updated 21 runs

Updated 3 runs

Updated 101 runs

Updated 53 runs

Updated 235 runs

Updated 78 runs

Updated 32 runs

Updated 1.5K runs

Updated 54 runs

Updated 75 runs

Updated 431 runs

ECCV2022 Quick background removal

Updated 42 runs

A good anime merge from 12 other models

Updated 1.1K runs

Updated 251 runs

Updated 159 runs

Great text-to-image model by Cagliostro Lab

Updated 2.9K runs

Updated 602 runs

⚡️ Blazing fast audio transcription with speaker diarization | Whisper Large V3 Turbo | word & sentence level timestamps | prompt

Updated 1.2M runs

OmniParser is a screen parsing tool to convert general GUI screen to structured elements.

Updated 39.5K runs

Use a mask to inpaint the image or generate a prompt based on the mask.

Updated 42.7K runs

Place items in a scene without needing to train on them

Updated 2.5K runs