Explore

Featured models

black-forest-labs / flux-canny-pro

Professional edge-guided image generation. Control structure and composition using Canny edge detection

black-forest-labs / flux-fill-pro

Professional inpainting and outpainting model with state-of-the-art performance. Edit or extend images with natural, seamless results.

8K runs

black-forest-labs / flux-1.1-pro-ultra

FLUX1.1 [pro] in ultra and raw modes. Images are up to 4 megapixels. Use raw mode for realism.

881.2K runs

black-forest-labs / flux-redux-dev

Open-weight image variation model. Create new versions while preserving key elements of your original.

5K runs

recraft-ai / recraft-v3

Recraft V3 (code-named red_panda) is a text-to-image model with the ability to generate long texts, and images in a wide list of styles. As of today, it is SOTA in image generation, proven by the Text-to-Image Benchmark by Artificial Analysis

277.5K runs

ibm-granite / granite-3.0-8b-instruct

Granite-3.0-8B-Instruct is a lightweight and open-source 8B parameter model is designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.

117.9K runs

I want to…

Generate images

Models that generate images from text prompts

Use a language model

Models that can understand and generate text

Upscale images

Upscaling models that create high-quality images from low-quality images

Caption images

Models that generate text from images

The FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Get embeddings

Models that generate embeddings from inputs

Extract text from images

Optical character recognition (OCR) and text extraction

Transcribe speech

Models that convert speech to text

Use handy tools

Toolbelt-type models for videos and images.

Chat with images

Ask language models about images

Edit images

Tools for manipulating images.

Use a face to make images

Make realistic images of people instantly

Flux fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Generate music

Models to generate and modify music

Generate videos

Models that create and edit videos

Generate speech

Convert text to speech

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Get structured data

Language models that support grammar-based decoding as well as jsonschema constraints.

Popular models

abiruyt/text-extract-ocr

A simple OCR Model that can easily extract text from an image.

Updated 1 year, 1 month ago 60.6M runs

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 2 months, 1 week ago 574.6M runs

falcons-ai/nsfw_image_detection

Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification

Updated 1 year ago 28.7M runs

zf-kbot/sd-inpaint

Fill in masked parts of images with Stable Diffusion

Updated 2 months, 2 weeks ago 7.4M runs

salesforce/blip

Generate image captions

Updated 2 years, 1 month ago 108.4M runs

yorickvp/llava-13b

Visual instruction tuning towards large language and vision models with GPT-4 level capabilities

Updated 4 months, 1 week ago 19.1M runs

andreasjansson/clip-features

Return CLIP features for the clip-vit-large-patch14 model

Updated 1 year, 8 months ago 64.3M runs

stability-ai/sdxl

A text-to-image generative AI model that creates beautiful images

Updated 6 months ago 70.6M runs

Latest models

adirik/syncdiffusion

Generate panoramic images with text prompts

Updated 9 months, 3 weeks ago 117 runs

jyoung105/honeybee

Locality-enhanced Projector for Multimodal LLM

Updated 9 months, 3 weeks ago 24 runs

siamakf/campfire-all-characters

Updated 9 months, 3 weeks ago 54 runs

meta/codellama-70b-python

A 70 billion parameter Llama tuned for coding with Python

Updated 9 months, 3 weeks ago 1.1K runs

jyoung105/imp

a family of multimodal small language models

Updated 9 months, 3 weeks ago 66 runs

spuuntries/flatdolphinmaid-8x7b-gguf

Undi95's FlatDolphinMaid 8x7B Mixtral Merge, GGUF Q5_K_M quantized by TheBloke.

Updated 9 months, 3 weeks ago 413.2K runs

fadighawanmeh/fadi-musicgen1

Generate Arab Maqam Melodic Improvisations (Taqasim)

Updated 9 months, 4 weeks ago 22 runs

msamogh/iiu-generator-llama2-7b-2

Updated 9 months, 4 weeks ago 14 runs

tgohblio/instant-id-albedobase-xl

InstantID : Zero-shot Identity-Preserving Generation in Seconds with ⚡️LCM-LoRA⚡️. Using AlbedoBase-XL v2.0 as base model.

Updated 9 months, 4 weeks ago 104.8K runs

lucataco/img-and-audio2video

Take an image and an audio file and create a video clip

Updated 9 months, 4 weeks ago 2.4K runs

daun-io/openroleplay.ai-animagine-v3

Fork of cagliostrolab/animagine-xl-3, an anime style Stable Diffusion XL

Updated 9 months, 4 weeks ago 5.5K runs

lucataco/watermark_detector

amrul-hzz's fine-tuned version of vit-base-patch16-224-in21k for watermark image detection

Updated 9 months, 4 weeks ago 275 runs

asiryan/proteus-v0.2

Proteus v0.2 Model (Text2Img, Img2Img and Inpainting)

Updated 10 months ago 13.5K runs

fofr/txt2img

Many models: RealVisXL, Juggernaut, Proteus, DreamShaper, etc.

Updated 10 months ago 10.5K runs

lebonze/museai

I fed the beast my oil paintings, made in the south of France. (version ec0d4305 is my fav)

Updated 10 months ago 4.2K runs

dsingal0/mixtral-single-gpu

Runs Mixtral 8x7B on a single A40 GPU

Updated 10 months ago 55 runs

sakemin/musicgen-remixer

Remix the music into another styles with MusicGen Chord

Updated 10 months ago 9.2K runs

lucataco/moondream1

(Research only) Moondream1 is a vision language model that performs on par with models twice its size

Updated 10 months ago 10.4K runs

datacte/proteus-v0.2

Proteus v0.2 shows subtle yet significant improvements over Version 0.1. It demonstrates enhanced prompt understanding that surpasses MJ6, while also approaching its stylistic capabilities.

Updated 10 months ago 8.2M runs

jyoung105/moondream

Tiny vision language model

Updated 10 months ago 293 runs

spuuntries/borealis-10.7b-dpo-gguf

Undi95's Borealis 10.7B Mistral DPO Finetune, GGUF Q5_K_M quantized by Undi95.

Updated 10 months ago 73 runs

grandlineai/instant-id-photorealistic

InstantID : Zero-shot Identity-Preserving Generation in Seconds. Using Juggernaut-XL v8 as the base model to encourage photorealism

Updated 10 months ago 28.8K runs

grandlineai/instant-id-artistic

InstantID : Zero-shot Identity-Preserving Generation in Seconds. Using Dreamshaper-XL as the base model to encourage artistic generations

Updated 10 months ago 1.9K runs

nateraw/musicgen-songstarter-v0.1

Generate song ideas!

Updated 10 months ago 578 runs

cjwbw/depth-anything

Highly practical solution for robust monocular depth estimation by training on a combination of 1.5M labeled images and 62M+ unlabeled images

Updated 10 months ago 4.6K runs

lucataco/siglip

SigLIP proposes to replace the loss function used in CLIP by a simple pairwise sigmoid loss

Updated 10 months ago 138 runs

lucataco/wizardcoder-33b-v1.1-gguf

WizardCoder: Empowering Code Large Language Models with Evol-Instruct

Updated 10 months ago 17K runs

cjwbw/tokenflow

Consistent Diffusion Features for Consistent Video Editing

Updated 10 months ago 2K runs

artificialguybr/nebul.redmond

Nebul.Redmond - Stable Diffusion SD XL Finetuned Model

Updated 10 months ago 16.2K runs

tencentarc/photomaker-style

Create photos, paintings and avatars for anyone in any style within seconds. (Stylization version)

Updated 10 months ago 863.8K runs

pollinations/amt

Video Smoother: AMT All-Pairs Multi-Field Transforms for Efficient Frame Interpolation

Updated 10 months ago 16K runs

meepo-pro-player/invoker

Updated 10 months ago 34.8K runs

sontungpytn/comfyui-lora-upscaler

Updated 10 months ago 85.4K runs

kcaverly/neuralbeagle14-7b-gguf

NeuralBeagle14-7B is (probably) the best 7B model you can find!

Updated 10 months ago 12.2K runs

voku682/video_style_transfer

Updated 10 months ago 263 runs

tomasmcm/sensei-7b-v1

Source: SciPhi/Sensei-7B-V1 ✦ Quant: TheBloke/Sensei-7B-V1-AWQ ✦ Sensei is specialized in performing RAG over detailed web search results

Updated 10 months ago 34 runs

tomasmcm/whiterabbitneo-13b

Source: WhiteRabbitNeo/WhiteRabbitNeo-13B-v1 ✦ TheBloke/WhiteRabbitNeo-13B-AWQ ✦ WhiteRabbitNeo is a model series that can be used for offensive and defensive cybersecurity

Updated 10 months ago 115 runs