Explore

I want to…

Use official models

Official models are always on, maintained, and have predictable pricing.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Detect objects

Models that detect or segment objects in images and videos.

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Latest models

Emu3-Gen for image generation

Updated 40 runs

Updated 1.7K runs

allenai/Molmo-7B-D-0924, Answers questions and caption about images

Updated 72.9K runs

🎼FluxMusic Text-to-Music Generation with Rectified Flow Transformer🎶

Updated 1.1K runs

Meta Llama 3.2 1B

Updated 1.5K runs

Meta Llama 3.2 1B

Updated 166 runs

Omni-Zero Couples: A diffusion pipeline for zero-shot stylized couples portrait creation.

Updated 3.8K runs

Bielik-11B-v2.3-Instruct is a generative text model made by SpeakLeash and Cyfronet featuring 11 billion parameters. It is a linear merge of the Bielik-11B-v2.0-Instruct, Bielik-11B-v2.1-Instruct, and Bielik-11B-v2.2-Instruct models.

Updated 1.1K runs

Implementation of tencent-ailab/IP-Adapter with ip-adapter-plus-face_sd15

Updated 141 runs

CogVLM2: Visual Language Models for Image and Video Understanding

Updated 614.3K runs

CogVLM2: Visual Language Models for Image and Video Understanding

Updated 480 runs

Quickly edit the expression of a face

Updated 57.7K runs

Seamless Speech Interaction with Large Language Models

Updated 58.3K runs

Ollama Qwen2.5 72b

Updated 704 runs

Image-to-Video Diffusion Models with An Expert Transformer

Updated 837 runs

Text-to-Video Diffusion Models with An Expert Transformer

Updated 239 runs

Explore how Flux Dev responds when you change the strengths of layers in the model. See readme for examples of how to select layers.

Updated 8.5K runs

Image Caption model

Updated 356 runs

FLUX.1-dev Inpainting ControlNet model

Updated 7.4K runs

Create lifelike interior designs with AI from text descriptions and image references.

Updated 1.6K runs

Run inpainting with Flux, compatible with Canny ControlNet, LoRAs and HyperFlux_8step

Updated 5.8K runs

An experimental flux based model for creative research

Updated 37 runs

SD1.5 Canny controlnet with LoRA support.

Updated 491.8K runs

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 775.9M runs

⚡️FLUX PuLID: FLUX-dev based Pure and Lightning ID Customization via Contrastive Alignment🎭

Updated 938.7K runs

Controlling SD XL diffusion inference

Updated 7 runs

Interior remodelling, keeps windows, ceilings, and doors. Uses a depth controlnet weighted to ignore existing furniture.

Updated 10.8K runs

Match facial expression using a driving image using LivePortrait as a base

Updated 35.2K runs

Updated 11 runs

Updated 8 runs

FalconAIs NSFW detection model, extended for videos

Updated 16.5K runs

Updated 205 runs

视频合并

Updated 1.3K runs

⚡️ Fast audio transcription | whisper large-v3 | speaker diarization | word & sentence level timestamps | prompt | hotwords

Updated 909.7K runs

输入图片和音频合并关键帧视频

Updated 5.1K runs

视频转换工具包

Updated 4 runs

Ollama Reflection 70b

Updated 1.6K runs

minicpm 视频理解

Updated 390 runs

Fine-tuned version of the LLaMA-3.1-8B model, specifically optimized for tasks in finance, economics, trading, psychology, and social engineering.

Updated 47 runs

whisper-large-v3, incredibly fast, with speaker diarization, powered by Hugging Face Transformers! 🤗

Updated 127 runs

Updated 1.5K runs

Updated 121 runs

FLUX.1-Schnell LoRA Explorer

Updated 728.1K runs

Detect beats in music

Updated 12 runs

Compare nsfw models against inputs

Updated 104 runs

Chat with image or video.

Updated 493 runs

This project uses the Segment Anything 2 (SAM2) model to remove backgrounds from videos.

Updated 913 runs

multi controlnet union pro <-

Updated 70 runs

AI that transforms sketches into realistic images. Upload your drawing and describe it in the prompt. You can also adjust the ControlNet parameters and scale the image to a higher resolution for better results

Updated 1.5K runs

Kolors Model (Text2Img and Img2Img)

Updated 12.3K runs