Explore

Featured models

black-forest-labs / flux-canny-pro

Professional edge-guided image generation. Control structure and composition using Canny edge detection

black-forest-labs / flux-fill-pro

Professional inpainting and outpainting model with state-of-the-art performance. Edit or extend images with natural, seamless results.

8.1K runs

black-forest-labs / flux-1.1-pro-ultra

FLUX1.1 [pro] in ultra and raw modes. Images are up to 4 megapixels. Use raw mode for realism.

884.6K runs

black-forest-labs / flux-redux-dev

Open-weight image variation model. Create new versions while preserving key elements of your original.

5K runs

recraft-ai / recraft-v3

Recraft V3 (code-named red_panda) is a text-to-image model with the ability to generate long texts, and images in a wide list of styles. As of today, it is SOTA in image generation, proven by the Text-to-Image Benchmark by Artificial Analysis

277.7K runs

ibm-granite / granite-3.0-8b-instruct

Granite-3.0-8B-Instruct is a lightweight and open-source 8B parameter model is designed to excel in instruction following tasks such as summarization, problem-solving, text translation, reasoning, code tasks, function-calling, and more.

117.9K runs

I want to…

Generate images

Models that generate images from text prompts

Use a language model

Models that can understand and generate text

Upscale images

Upscaling models that create high-quality images from low-quality images

Caption images

Models that generate text from images

The FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Get embeddings

Models that generate embeddings from inputs

Extract text from images

Optical character recognition (OCR) and text extraction

Transcribe speech

Models that convert speech to text

Use handy tools

Toolbelt-type models for videos and images.

Chat with images

Ask language models about images

Edit images

Tools for manipulating images.

Use a face to make images

Make realistic images of people instantly

Flux fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Generate music

Models to generate and modify music

Generate videos

Models that create and edit videos

Generate speech

Convert text to speech

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Get structured data

Language models that support grammar-based decoding as well as jsonschema constraints.

Popular models

abiruyt/text-extract-ocr

A simple OCR Model that can easily extract text from an image.

Updated 1 year, 1 month ago 60.8M runs

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 2 months, 1 week ago 574.7M runs

falcons-ai/nsfw_image_detection

Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification

Updated 1 year ago 28.7M runs

salesforce/blip

Generate image captions

Updated 2 years, 1 month ago 108.4M runs

zf-kbot/sd-inpaint

Fill in masked parts of images with Stable Diffusion

Updated 2 months, 2 weeks ago 7.4M runs

nightmareai/real-esrgan

Real-ESRGAN with optional face correction and adjustable upscale

Updated 4 months ago 56.1M runs

andreasjansson/blip-2

Answers questions about images

Updated 1 year ago 26.4M runs

stability-ai/sdxl

A text-to-image generative AI model that creates beautiful images

Updated 6 months ago 70.6M runs

Latest models

dhanushreddy291/manmaru-mix-v3

Manmaru mix v3.0

Updated 10 months, 1 week ago 695 runs

tomasmcm/digital-socrates-13b

Source: allenai/digital-socrates-13b ✦ Quant: TheBloke/digital-socrates-13B-AWQ ✦ Digital Socrates is an open-source, automatic explanation-critiquing model

Updated 10 months, 1 week ago 17 runs

tomasmcm/towerinstruct-7b-v0.1

Source: Unbabel/TowerInstruct-7B-v0.1 ✦ Quant: TheBloke/TowerInstruct-7B-v0.1-AWQ ✦ This model is trained to handle several translation-related tasks, such as general machine translation, gramatical error correction, and paraphrase generation

Updated 10 months, 1 week ago 186 runs

csslc/ccsr

Improving the Stability of Diffusion Models for Content Consistent Super-Resolution

Updated 10 months, 1 week ago 3.2K runs

datacte/proteus-v0.1

ProteusV0.1 uses OpenDalleV1.1 as a base and further refines prompt adherence and stylistic capabilities to a measurable degree

Updated 10 months, 1 week ago 6.6K runs

batouresearch/sdxl-improved-refiner

Great image quality, good old SDXL with a new and improved Tile refiner.

Updated 10 months, 1 week ago 768 runs

chenxwh/video-retalking

Audio-based Lip Synchronization for Talking Head Video

Updated 10 months, 1 week ago 26K runs

dhanushreddy291/sdvn10-anime

SDVN10-Anime

Updated 10 months, 1 week ago 330 runs

gustavo-kuze/sdxl-mate

Updated 10 months, 1 week ago 199 runs

bluematter/sdxl-bluejeans

SDXL Trained on blue jeans

Updated 10 months, 1 week ago 250 runs

ness-ai/akabane-all-02

赤羽全域のモデルです！最新版なのでこちらをお使いください！

Updated 10 months, 1 week ago 1K runs

alexgenovese/bg-remover

Improved background remover 2.0 - GroundingDino + SAM + Inpainting SDXL + Controlnet Canny

Updated 10 months, 1 week ago 205 runs

erium/whisperx

Automatic Speech Recognition with Word-level Timestamps & Diarization

Updated 10 months, 1 week ago 3.2K runs

piddnad/ddcolor

Towards Photo-Realistic Image Colorization via Dual Decoders

Updated 10 months, 1 week ago 154.8K runs

tomasmcm/neuronovo-7b-v0.3

Source: Neuronovo/neuronovo-7B-v0.3 ✦ Quant: TheBloke/neuronovo-7B-v0.3-AWQ ✦ Neuronovo/neuronovo-7B-v0.3 model represents an advanced and fine-tuned version of a large language model, initially based on CultriX/MistralTrix-v1.

Updated 10 months, 1 week ago 37 runs

ieit-yuan/yuan2.0-2b

Yuan2.0 is a new generation LLM developed by IEIT System, enhanced the model's understanding of semantics, mathematics, reasoning, code, knowledge, and other aspects.

Updated 10 months, 1 week ago 384 runs

cswry/seesr

SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution

Updated 10 months, 1 week ago 48.3K runs

lucataco/pheme

Pheme generates a variety of conversational voices in 16 kHz for phone-call applications

Updated 10 months, 2 weeks ago 403 runs

farbodmehr/fcm4

Updated 10 months, 2 weeks ago 69 runs

cuuupid/sdxl-meow

make meow emojis!

Updated 10 months, 2 weeks ago 53 runs

internetcommunitycompany/lora-niji

90s anime

Updated 10 months, 2 weeks ago 21.3K runs

farbodmehr/fcm3

Updated 10 months, 2 weeks ago 183 runs

geeklab-ltd/app-icon-generator

Generates game icons, For full use: appiconlab.com

Updated 10 months, 2 weeks ago 214.3K runs

beautyyuyanli/multilingual-e5-large

multilingual-e5-large: A multi-language text embedding model

Updated 10 months, 2 weeks ago 10.5M runs

beautyyuyanli/multilingual-e5-base

multilingual-e5-base: A multi-language text embedding model

Updated 10 months, 2 weeks ago 5 runs

beautyyuyanli/multilingual-e5-small

multilingual-e5-small: A multi-language text embedding model

Updated 10 months, 2 weeks ago 11 runs

farbodmehr/fcm2

Updated 10 months, 2 weeks ago 40 runs

vidalfer/game-music-generator

Gerador de música para games condicionado a emoção feito para a disciplina de Residência em IA do Bacharelado em Inteligência Artificial-UFG

Updated 10 months, 2 weeks ago 179 runs

dhanushreddy291/amused-text-to-image

Amused is a lightweight text to image model based off of the muse architecture. Amused is particularly useful in applications that require a lightweight and fast model such as generating many images quickly at once.

Updated 10 months, 2 weeks ago 192 runs