Explore

Featured models

black-forest-labs / flux-canny-pro

Professional edge-guided image generation. Control structure and composition using Canny edge detection

black-forest-labs / flux-fill-pro

Professional inpainting and outpainting model with state-of-the-art performance. Edit or extend images with natural, seamless results.

9.7K runs

black-forest-labs / flux-1.1-pro-ultra

FLUX1.1 [pro] in ultra and raw modes. Images are up to 4 megapixels. Use raw mode for realism.

937.6K runs

black-forest-labs / flux-redux-dev

Open-weight image variation model. Create new versions while preserving key elements of your original.

5.5K runs

recraft-ai / recraft-v3

Recraft V3 (code-named red_panda) is a text-to-image model with the ability to generate long texts, and images in a wide list of styles. As of today, it is SOTA in image generation, proven by the Text-to-Image Benchmark by Artificial Analysis

284.4K runs

davisbrown / flux-half-illustration

Flux lora, use "in the style of TOK" to trigger generation, creates half photo half illustrated elements

21.7K runs

I want to…

Generate images

Models that generate images from text prompts

Use a language model

Models that can understand and generate text

Upscale images

Upscaling models that create high-quality images from low-quality images

Caption images

Models that generate text from images

The FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Get embeddings

Models that generate embeddings from inputs

Extract text from images

Optical character recognition (OCR) and text extraction

Transcribe speech

Models that convert speech to text

Use handy tools

Toolbelt-type models for videos and images.

Chat with images

Ask language models about images

Edit images

Tools for manipulating images.

Use a face to make images

Make realistic images of people instantly

Flux fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Generate music

Models to generate and modify music

Generate videos

Models that create and edit videos

Generate speech

Convert text to speech

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Get structured data

Language models that support grammar-based decoding as well as jsonschema constraints.

Popular models

abiruyt/text-extract-ocr

A simple OCR Model that can easily extract text from an image.

Updated 1 year, 1 month ago 63.9M runs

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 2 months, 1 week ago 575.9M runs

falcons-ai/nsfw_image_detection

Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification

Updated 1 year ago 29M runs

openai/whisper

Convert speech in audio to text

Updated 4 hours ago 45.6M runs

salesforce/blip

Generate image captions

Updated 2 years, 1 month ago 108.5M runs

zf-kbot/sd-inpaint

Fill in masked parts of images with Stable Diffusion

Updated 2 months, 2 weeks ago 7.5M runs

andreasjansson/clip-features

Return CLIP features for the clip-vit-large-patch14 model

Updated 1 year, 8 months ago 64.4M runs

yorickvp/llava-13b

Visual instruction tuning towards large language and vision models with GPT-4 level capabilities

Updated 4 months, 1 week ago 19.2M runs

Latest models

zsxkib/animatediff-prompt-travel

🎨AnimateDiff Prompt Travel🧭 Seamlessly Navigate and Animate Between Text-to-Image Prompts for Dynamic Visual Narratives

Updated 1 year, 1 month ago 5.6K runs

abiruyt/text-extract-ocr

A simple OCR Model that can easily extract text from an image.

Updated 1 year, 1 month ago 63.9M runs

lunaon2023/repro31

Updated 1 year, 1 month ago 345 runs

jd7h/xmem

Video object segmentation for short and long videos

Updated 1 year, 1 month ago 26 runs

hudsongraeme/roadster

An SDXL fine-tune on the Tesla Roadster 2.0

Updated 1 year, 1 month ago 76 runs

lucataco/video-crafter

Open diffusion model for high-quality video generation

Updated 1 year, 1 month ago 10K runs

luosiallen/latent-consistency-model

Synthesizing High-Resolution Images with Few-Step Inference

Updated 1 year, 1 month ago 1.1M runs

anotherjesse/imagebind_batch

Batch mode for text & image embeddings

Updated 1 year, 1 month ago 68 runs

andreasjansson/musicgen-gabba

Updated 1 year, 1 month ago 25 runs

cjwbw/scalecrafter

Tuning-free Higher-Resolution Visual Generation with Diffusion Models

Updated 1 year, 1 month ago 1.1K runs

fofr/musicgen-slipknot

MusicGen fine tuned on Slipknot

Updated 1 year, 1 month ago 164 runs

fofr/musicgen-tron

MusicGen fine-tuned on Tron

Updated 1 year, 1 month ago 176 runs

fofr/musicgen-sonic

MusicGen fine-tuned on Sonic the Hedgehog 2

Updated 1 year, 1 month ago 316 runs

fofr/musicgen-choral

MusicGen fine-tuned on chamber choir music

Updated 1 year, 1 month ago 4.6K runs

adirik/deforum-kandinsky-2-2

Generate videos from text prompts with Kandinsky-2.2

Updated 1 year, 1 month ago 7.3K runs

andreasjansson/musicgen-bacharach-vox

MusicGen fine-tuned on Burt Bacharach's top hits

Updated 1 year, 1 month ago 52 runs

peter65374/sam-vit

SAM(Segment Anything) ViT-H image encoder

Updated 1 year, 1 month ago 278.4K runs

venkr/whisperx-diarization

Whisper-Large-V2 + Pyannote 3.0 diarization via WhisperX

Updated 1 year, 1 month ago 82 runs

replicategithubwc/anime-pastel-dream

Anime Pastel Dream Model For Splurge Art

Updated 1 year, 1 month ago 4.5K runs

replicategithubwc/neurogen

Neurogen Model for Splurge Art

Updated 1 year, 1 month ago 2.8K runs

replicategithubwc/dreamlike-anime

Dreamlike Anime 1.0 for Splurge Art

Updated 1 year, 1 month ago 5.1K runs

replicategithubwc/dreamlike-photoreal

Dreamlike Photoreal Model for Splurge Art

Updated 1 year, 1 month ago 2.9K runs

cjwbw/show-1

Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

Updated 1 year, 1 month ago 967 runs

nateraw/nous-hermes-llama2-awq

TheBloke/Nous-Hermes-Llama2-AWQ served with vLLM

Updated 1 year, 1 month ago 7.2K runs

bzikst/wav2vec2-large-xlsr-53-gender-recognition-librispeech

Gender recognition for audio files

Updated 1 year, 1 month ago 3.3K runs

andreasjansson/musicgen-bacharach-novox

Updated 1 year, 1 month ago 27 runs

samueltof/sdxl-gamo

Updated 1 year, 1 month ago 64 runs

peter65374/cog-resnet

cog-resnet example trial

Updated 1 year, 1 month ago 8 runs

sakemin/musicgen-toad

MusicGen fine-tuned on cover-songs by Toad from 'Super Mario' series. Text token : "by toad"

Updated 1 year, 1 month ago 47 runs

skripnik/call-transcriber

Make a transcription of a phone call

Updated 1 year, 1 month ago 10 runs

fofr/musicgen-mario

MusicGen fine-tuned on 8bit Super Mario Bros (1985)

Updated 1 year, 1 month ago 158 runs

sakemin/musicgen-newjeans-vocals

MusicGen trained on NewJeans with vocals

Updated 1 year, 1 month ago 241 runs

mjamroz/eva

Trained on plants

Updated 1 year, 1 month ago 28 runs

zeke/zisper

My own personal copy of daanelson/whisperx

Updated 1 year, 1 month ago 311 runs

lucataco/qwen-vl-chat

A multimodal LLM-based AI assistant, which is trained with alignment techniques. Qwen-VL-Chat supports more flexible interaction, such as multi-round question answering, and creative capabilities.

Updated 1 year, 1 month ago 791K runs