Explore

Featured models

black-forest-labs / flux-canny-pro

Professional edge-guided image generation. Control structure and composition using Canny edge detection

black-forest-labs / flux-fill-pro

Professional inpainting and outpainting model with state-of-the-art performance. Edit or extend images with natural, seamless results.

8.3K runs

black-forest-labs / flux-1.1-pro-ultra

FLUX1.1 [pro] in ultra and raw modes. Images are up to 4 megapixels. Use raw mode for realism.

887.7K runs

black-forest-labs / flux-redux-dev

Open-weight image variation model. Create new versions while preserving key elements of your original.

5K runs

recraft-ai / recraft-v3

Recraft V3 (code-named red_panda) is a text-to-image model with the ability to generate long texts, and images in a wide list of styles. As of today, it is SOTA in image generation, proven by the Text-to-Image Benchmark by Artificial Analysis

278K runs

ibm-granite / granite-3.0-8b-instruct

Granite-3.0-8B-Instruct is a lightweight and open-source 8B parameter model is designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.

117.9K runs

I want to…

Generate images

Models that generate images from text prompts

Use a language model

Models that can understand and generate text

Upscale images

Upscaling models that create high-quality images from low-quality images

Caption images

Models that generate text from images

The FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Get embeddings

Models that generate embeddings from inputs

Extract text from images

Optical character recognition (OCR) and text extraction

Transcribe speech

Models that convert speech to text

Use handy tools

Toolbelt-type models for videos and images.

Chat with images

Ask language models about images

Edit images

Tools for manipulating images.

Use a face to make images

Make realistic images of people instantly

Flux fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Generate music

Models to generate and modify music

Generate videos

Models that create and edit videos

Generate speech

Convert text to speech

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Get structured data

Language models that support grammar-based decoding as well as jsonschema constraints.

Popular models

abiruyt/text-extract-ocr

A simple OCR Model that can easily extract text from an image.

Updated 1 year, 1 month ago 61.1M runs

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 2 months, 1 week ago 574.8M runs

falcons-ai/nsfw_image_detection

Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification

Updated 1 year ago 28.8M runs

salesforce/blip

Generate image captions

Updated 2 years, 1 month ago 108.4M runs

zf-kbot/sd-inpaint

Fill in masked parts of images with Stable Diffusion

Updated 2 months, 2 weeks ago 7.4M runs

nightmareai/real-esrgan

Real-ESRGAN with optional face correction and adjustable upscale

Updated 4 months ago 56.1M runs

andreasjansson/blip-2

Answers questions about images

Updated 1 year ago 26.4M runs

stability-ai/sdxl

A text-to-image generative AI model that creates beautiful images

Updated 6 months ago 70.6M runs

Latest models

lucataco/sdxl-deepcache

SDXL using DeepCache

Updated 10 months, 2 weeks ago 3.8K runs

hyuse202/sef

Chest X ray

Updated 10 months, 2 weeks ago 2.5K runs

vkolagotla/bapubomma_ai

A LoRA fine tuned version of SDXL trained on late legendary Indian artist Bapu's art work

Updated 10 months, 2 weeks ago 230 runs

genkernel/headshot-public

Just an experiment. Nothing new here.

Updated 10 months, 2 weeks ago 2K runs

ali-vilab/anydoor

Anydoor: zero-shot object-level image customization

Updated 10 months, 2 weeks ago 1.9K runs

fofr/realvisxl-v3-multi-controlnet-lora

RealVisXl V3 with multi-controlnet, lora loading, img2img, inpainting

Updated 10 months, 2 weeks ago 944.5K runs

tomasmcm/pandalyst-7b-v1.2

Source: pipizhao/Pandalyst-7B-V1.2 ✦ Quant: TheBloke/Pandalyst-7B-v1.2-AWQ ✦ Pandalyst: A large language model for mastering data analysis using pandas

Updated 10 months, 2 weeks ago 18 runs

hamelsmu/honeycomb

Honeycomb NLQ Generator

Updated 10 months, 3 weeks ago 36 runs

ali-vilab/i2vgen-xl

RESEARCH/NON-COMMERCIAL USE ONLY: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models

Updated 10 months, 3 weeks ago 112.6K runs

anotherjesse/fastai-bird

fastai lesson 1 - bird or forest

Updated 10 months, 3 weeks ago 226 runs

lucataco/tinyllama-1.1b-chat-v1.0

This is the chat model finetuned on top of TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T

Updated 10 months, 3 weeks ago 377 runs

nvlabs/parakeet-rnnt-1.1b

🗣️ Nvidia + Suno.ai's speech-to-text conversion with high accuracy and efficiency 📝

Updated 10 months, 3 weeks ago 1.6K runs

archievilliers/sdxl-picasso

An SDXL fine-tune based on Picasso's work

Updated 10 months, 3 weeks ago 242 runs

georgedavila/sdxl-basquiat

SDXL LoRA finetuned on Basquiat Paintings

Updated 10 months, 3 weeks ago 166 runs

arusterholz-edu/bioshock

Updated 10 months, 3 weeks ago 45 runs

cudanexus/nougat

Nougat: Neural Optical Understanding for Academic Documents

Updated 10 months, 3 weeks ago 224 runs

matsuitran/sdxl-anime-schoolboy

Stable Diffusion XL fine-tunned in the theme of Anime Schoolboy

Updated 10 months, 3 weeks ago 338 runs

sakemin/audiosr-long-audio

Versatile Audio Super-resolution at Scale which upsamples audio files to 48khz. Longer audio input is possible with this model

Updated 10 months, 3 weeks ago 2.1K runs

cudanexus/detic

Detecting Twenty-thousand Classes using Image-level Supervision

Updated 10 months, 3 weeks ago 264 runs

matsuitran/sdxl-anime-schoolgirl

Stable Diffusion XL fine-tunned in the theme of Anime Schoolgirl

Updated 10 months, 3 weeks ago 246 runs

tomasmcm/tinyllama-1.1b-chat-v1.0

Source: TinyLlama/TinyLlama-1.1B-Chat-v1.0 ✦ Quant: TheBloke/TinyLlama-1.1B-Chat-v1.0-AWQ ✦ The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Updated 10 months, 3 weeks ago 104 runs

georgedavila/ay-chihuahua-sdxl-lora

SDXL LoRA I trained on chihuahua images

Updated 10 months, 3 weeks ago 24 runs

sakemin/pytsmod

PyTSMod is an open-source library for Time-Scale Modification(eg. time-stretching) algorithms, by Sangeon Yong at MAC Lab, KAIST.

Updated 10 months, 3 weeks ago 171 runs

georgedavila/bart-large-mnli-classifier

Zero-shot classifier which classifies text into categories of your choosing. Returns a dictionary of the most likely class and all class likelihoods.

Updated 10 months, 3 weeks ago 410 runs

nateraw/nous-hermes-2-solar-10.7b

Nous Hermes 2 - SOLAR 10.7B is the flagship Nous Research model on the SOLAR 10.7B base model..

Updated 10 months, 3 weeks ago 65.3K runs

kcaverly/nous-hermes-2-solar-10.7b-gguf

Nous Hermes 2 - SOLAR 10.7B is the flagship Nous Research model on the SOLAR 10.7B base model.

Updated 10 months, 3 weeks ago 7.7K runs

nqvinh/sdxl-heroestd

Updated 10 months, 3 weeks ago 127 runs

open-mmlab/pia

Personalized Image Animator

Updated 10 months, 3 weeks ago 100.3K runs

tomasmcm/docsgpt-7b-mistral

Source: Arc53/docsgpt-7b-mistral ✦ Quant: TheBloke/docsgpt-7B-mistral-AWQ ✦ DocsGPT is optimized for Documentation (RAG), fine-tuned for providing answers that are based on context

Updated 10 months, 3 weeks ago 74 runs

alexgenovese/upscaler

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration

Updated 10 months, 3 weeks ago 792.9K runs

batouresearch/open-dalle-1.1-lora

Better than SDXL at both prompt adherence and image quality, by dataautogpt3

Updated 10 months, 3 weeks ago 128.2K runs

fictions-ai/autocaption

Automatically add captions to a video

Updated 10 months, 4 weeks ago 25.9K runs

bawgz/stable-dripfusion-2

Updated 10 months, 4 weeks ago 259 runs

musicly-ai/singing_voice_conversion

this is the replicate version of singing_voice_conversion from amphion

Updated 10 months, 4 weeks ago 516 runs

charlesmccarthy/animagine-xl

Animagine XL 2.0 is an advanced latent text-to-image diffusion model designed to create high-resolution, detailed anime images.

Updated 10 months, 4 weeks ago 7.4K runs

moayedhajiali/elasticdiffusion

ElasticDiffusion: Training-free Arbitrary Size Image Generation

Updated 10 months, 4 weeks ago 168 runs

zsxkib/patch-fusion

Super High Quality Depth Maps 🗺️: An End-to-End Tile-Based Framework 🏗️ for High-Resolution Monocular Metric Depth Estimation 🔍📏

Updated 10 months, 4 weeks ago 331 runs

lucataco/open-dalle-v1.1

A unique fusion that showcases exceptional prompt adherence and semantic understanding, it seems to be a step above base SDXL and a step closer to DALLE-3 in terms of prompt comprehension

Updated 10 months, 4 weeks ago 122.1K runs

lucataco/diffusion-motion-transfer

Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer

Updated 10 months, 4 weeks ago 172 runs

kcaverly/nous-hermes-2-yi-34b-gguf

Nous Hermes 2 - Yi-34B is a state of the art Yi Fine-tune, fine tuned on GPT-4 generated synthetic data

Updated 10 months, 4 weeks ago 11.3K runs

charlesmccarthy/terminus-xl-otaku-v1

Terminus XL Otaku is a latent diffusion model that uses zero-terminal SNR noise schedule and velocity prediction objective at training and inference time.

Updated 11 months ago 39 runs

usamaehsan/controlnet-x-majic-mix-realistic-x-ip-adapter

works with inpainting and multi-controlnet + single-controlnet || ip-adapter + without ip adapter

Updated 11 months ago 23.7K runs

cjwbw/faster-diffusion

Rethinking the Role of UNet Encoder in Diffusion Models

Updated 11 months ago 132 runs

charlesmccarthy/terminus-xl-gamma-v2

Terminus XL Gamma is a new state-of-the-art latent diffusion model that uses zero-terminal SNR noise schedule and velocity prediction objective at training and inference time.

Updated 11 months ago 271 runs

illscience/dreaminaudio

Fine-tune of music gen with tracks from my record label Dream In Audio.

Updated 11 months ago 127 runs

tomasmcm/sam-7b

Source: SuperAGI/SAM ✦ Quant: TheBloke/SAM-AWQ ✦ SAM (Small Agentic Model), a 7B model that demonstrates impressive reasoning abilities despite its smaller size

Updated 11 months ago 76 runs