Explore – Replicate

Explore

Featured models

minimax / video-01

Generate 6s videos with prompts or images. (Also known as Hailuo)

black-forest-labs / flux-fill-pro

Professional inpainting and outpainting model with state-of-the-art performance. Edit or extend images with natural, seamless results.

black-forest-labs / flux-1.1-pro-ultra

FLUX1.1 [pro] in ultra and raw modes. Images are up to 4 megapixels. Use raw mode for realism.

black-forest-labs / flux-redux-dev

Open-weight image variation model. Create new versions while preserving key elements of your original.

recraft-ai / recraft-v3

Recraft V3 (code-named red_panda) is a text-to-image model with the ability to generate long texts, and images in a wide list of styles. As of today, it is SOTA in image generation, proven by the Text-to-Image Benchmark by Artificial Analysis

davisbrown / flux-half-illustration

Flux lora, use "in the style of TOK" to trigger generation, creates half photo half illustrated elements

I want to…

Generate images

Models that generate images from text prompts

Use a language model

Models that can understand and generate text

Upscale images

Upscaling models that create high-quality images from low-quality images

Caption images

Models that generate text from images

The FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Get embeddings

Models that generate embeddings from inputs

Extract text from images

Optical character recognition (OCR) and text extraction

Transcribe speech

Models that convert speech to text

Use handy tools

Toolbelt-type models for videos and images.

Chat with images

Ask language models about images

Edit images

Tools for manipulating images.

Use a face to make images

Make realistic images of people instantly

Flux fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Generate music

Models to generate and modify music

Generate videos

Models that create and edit videos

Generate speech

Convert text to speech

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Get structured data

Language models that support grammar-based decoding as well as jsonschema constraints.

Popular models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 2 months, 1 week ago 578.3M runs

abiruyt/text-extract-ocr

A simple OCR Model that can easily extract text from an image.

Updated 1 year, 1 month ago 67.1M runs

falcons-ai/nsfw_image_detection

Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification

Updated 1 year ago 29.6M runs

openai/whisper

Convert speech in audio to text

Updated 2 days, 4 hours ago 46M runs

andreasjansson/clip-features

Return CLIP features for the clip-vit-large-patch14 model

Updated 1 year, 8 months ago 64.5M runs

zf-kbot/sd-inpaint

Fill in masked parts of images with Stable Diffusion

Updated 2 months, 3 weeks ago 7.8M runs

salesforce/blip

Generate image captions

Updated 2 years, 2 months ago 108.8M runs

nightmareai/real-esrgan

Real-ESRGAN with optional face correction and adjustable upscale

Updated 4 months ago 56.2M runs

Latest models

jack000/glid-3-xl

A 1.4B parameter text2im model from CompVis, finetuned on CLIP text embeds and curated data.

Updated 2 years, 6 months ago 45.5K runs

phamquiluan/facial-expression-recognition

Facial Expression Recognition using Residual Masking Network

Updated 2 years, 6 months ago 15.2K runs

nkolkin13/neuralneighborstyletransfer

Transfer the texture/style of one image onto another

Updated 2 years, 6 months ago 7.6K runs

elazarg/nakdimon

A simple Hebrew Diacritizer

Updated 2 years, 6 months ago 125 runs

microsoft/kid

Updated 2 years, 6 months ago 269 runs

bencevans/megadetector-v4.1

Detect Animals, Vehicles and Humans in Camera Trap Imagery

Updated 2 years, 6 months ago 563 runs

yxuansu/magic

Plugging Visual Controls in Text Generation

Updated 2 years, 6 months ago 1.4K runs

zeke/cog-markdown-example

Updated 2 years, 6 months ago 15 runs

cszn/scunet

Practical Blind Denoising via Swin-Conv-UNet and Data Synthesis

Updated 2 years, 6 months ago 23.5K runs

andreasjansson/music-inpainting-bert

Music inpainting of melody and chords

Updated 2 years, 6 months ago 8.5K runs

megvii-research/nafnet

Nonlinear Activation Free Network for Image Restoration

Updated 2 years, 7 months ago 1.3M runs

retrocirce/zero_shot_audio_source_separation

Zero shot Sound separation by arbitrary query samples

Updated 2 years, 7 months ago 41.5K runs

google-research/maxim

Multi-Axis MLP for Image Processing

Updated 2 years, 7 months ago 474.7K runs

ariel415el/simplify_contours

Detect and simplify the contours of a binary image

Updated 2 years, 7 months ago 221 runs

codeslake/refvsr-cvpr2022

Super-resolves an LR video frame (ultra-wide) using a reference video frame (wide-angle)

Updated 2 years, 7 months ago 14.3K runs

dribnet/homage1

Homage to the Pixel: text prompt to 6 color squares

Updated 2 years, 7 months ago 9.4K runs

williamyang1991/gp-unit

[CVPR 2022] Unsupervised Image-to-Image Translation with Generative Prior

Updated 2 years, 7 months ago 1.2K runs

pixray/api

bare pixray for API use

Updated 2 years, 8 months ago 11.6K runs

wty-ustc/hairclip

Design Your Hair by Text and Reference Image

Updated 2 years, 8 months ago 282.8K runs

ghadjeres/deepbach

A Steerable Model for Bach Chorales Generation

Updated 2 years, 8 months ago 844 runs

wendison/vqmivc

One-shot (any-to-any) Voice Conversion

Updated 2 years, 8 months ago 6.3K runs

csyxwei/orojar

Online demo for "Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation"

Updated 2 years, 8 months ago 1.9K runs

zeke/haiku-progressive-image

A model for testing pydantic cog that yields images one word at a time.

Updated 2 years, 8 months ago 126 runs

paper11667/clipstyler

Image Style Transfer with Text Condition

Updated 2 years, 8 months ago 25.6K runs

kvfrans/clipdraw

Synthesize drawings to match a text prompt

Updated 2 years, 8 months ago 5.5K runs

zeke/haiku-image

A model for testing pydantic cog that generates images.

Updated 2 years, 8 months ago 409 runs

zeke/haiku-progressive

A test model that generates Haiku (and yields output one word a time)

Updated 2 years, 8 months ago 104 runs

cjwbw/clip-guided-diffusion

Clip-Guided Diffusion Model for Image Generation

Updated 2 years, 8 months ago 4.5K runs

afiaka87/laionide-v4

GLIDE-text2im w/ humans and experimental style prompts.

Updated 2 years, 8 months ago 9.2K runs

andreasjansson/counter

Updated 2 years, 8 months ago 565 runs

zeke/pydantic-pixray

A fork of pixray/pixray for trying out Cog's new Predictor API

Updated 2 years, 8 months ago 58 runs

salesforce/albef

Grad-CAM visualizations for Align before Fuse

Updated 2 years, 8 months ago 3.6K runs

bfirsh/wave-u-net-pytorch

Updated 2 years, 8 months ago 75 runs

laion-ai/laionide-v2

GLIDE from OpenAI finetuned on roughly 30M more samples. See `laionide-v3` for the latest.

Updated 2 years, 9 months ago 3.8K runs

jxmorris12/piano-transcription

Transcribes piano audio and makes it into a cool video

Updated 2 years, 9 months ago 218 runs

meta/mask2former

Masked-attention Mask Transformer for Universal Image Segmentation

Updated 2 years, 9 months ago 658 runs

yuguochencuc/db-aiat

Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement

Updated 2 years, 9 months ago 358 runs

yoadtew/arithmetic

Updated 2 years, 9 months ago 94 runs

yoadtew/test

Updated 2 years, 9 months ago 147 runs

xl-sr/projected_gan

Generate Pokemons with Projected GAN

Updated 2 years, 9 months ago 9.7K runs

adirik/stylemc-old

Text-Guided Image Generation and Manipulation

Updated 2 years, 9 months ago 824 runs

meta/omnivore

A Single Model for Many Visual Modalities

Updated 2 years, 9 months ago 247 runs

meta/swag

Supervised Weakly from hashtAGs

Updated 2 years, 10 months ago 294 runs

vganapati/mnist-classification

Classify numerical digits.

Updated 2 years, 10 months ago 115 runs

music-and-culture-technology-lab/omnizart

democratizing automatic music transcription

Updated 2 years, 11 months ago 3K runs

dribnet/pixray-text2pixel-0x42

Uses pixray to generate an image from text prompt

Updated 2 years, 11 months ago 148.4K runs

gnobitab/fusedream

Training-Free Text-to-Image Generation

Updated 2 years, 11 months ago 2.4K runs

dribnet/pixray-tiler-future

Updated 2 years, 11 months ago 1.7K runs

dribnet/bex-research-portfolio-code

Updated 2 years, 11 months ago 750 runs

hohsiangwu/wav2clip

Image generation from Wav2CLIP through VQGAN-CLIP

Updated 2 years, 11 months ago 896 runs