Explore

Featured models

anthropic / claude-3.5-sonnet

Anthropic's most intelligent language model to date, with a 200K token context window and image understanding (claude-3-5-sonnet-20241022)

6.8K runs

minimax / video-01-director

Generate videos with specific camera movements

1.1K runs

google / imagen-3-fast

A faster and cheaper Imagen 3 model, for when price or speed are more important than final image quality

25K runs

google / imagen-3

Google's highest quality text-to-image model, capable of generating images with detail, rich lighting and beauty

57.2K runs

deepseek-ai / deepseek-r1

A reasoning model trained with reinforcement learning, on par with OpenAI o1

311.4K runs

tencent / hunyuan-video

A state-of-the-art text-to-video generation model capable of creating high-quality videos with realistic motion from text descriptions

63.6K runs

playht / play-dialog

End-to-end AI speech model designed for natural-sounding conversational speech synthesis, with support for context-aware prosody, intonation, and emotional expression.

4.9K runs

zsxkib / mmaudio

Add sound to video. An advanced AI model that synthesizes high-quality audio from video content, enabling seamless video-to-audio transformation

118.6K runs

recraft-ai / recraft-v3

Recraft V3 (code-named red_panda) is a text-to-image model with the ability to generate long texts, and images in a wide list of styles. As of today, it is SOTA in image generation, proven by the Text-to-Image Benchmark by Artificial Analysis

1.2M runs

I want to…

Generate images

Models that generate images from text prompts

Generate videos

Models that create and edit videos

Caption images

Models that generate text from images

Transcribe speech

Models that convert speech to text

Generate text

Models that can understand and generate text

Upscale images

Upscaling models that create high-quality images from low-quality images

Use official models

Official models are always on, maintained, and have predictable pricing.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Generate speech

Convert text to speech

Caption videos

Model s that generate text from videos

Remove backgrounds

Models that remove backgrounds from images and videos

Use handy tools

Toolbelt-type models for videos and images.

Detect objects

Models that detect or segment objects in images and videos.

Generate music

Models to generate and modify music

Sing with voices

Voice-to-voice cloning and musical prosody

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Chat with images

Ask language models about images

Use a face to make images

Make realistic images of people instantly

Extract text from images

Optical character recognition (OCR) and text extraction

Get embeddings

Models that generate embeddings from inputs

Use the FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Edit images

Tools for manipulating images.

Popular models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 5 months ago 775.8M runs

salesforce/blip

Generate image captions

Updated 2 years, 4 months ago 150.3M runs

openai/whisper

Convert speech in audio to text

Updated 2 months, 3 weeks ago 65.5M runs

falcons-ai/nsfw_image_detection

Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification

Updated 1 year, 2 months ago 52.8M runs

andreasjansson/clip-features

Return CLIP features for the clip-vit-large-patch14 model

Updated 1 year, 11 months ago 77.2M runs

abiruyt/text-extract-ocr

A simple OCR Model that can easily extract text from an image.

Updated 1 year, 3 months ago 89.8M runs

nightmareai/real-esrgan

Real-ESRGAN with optional face correction and adjustable upscale

Updated 6 months, 3 weeks ago 61.3M runs

stability-ai/sdxl

A text-to-image generative AI model that creates beautiful images

Updated 8 months, 3 weeks ago 75.8M runs

Latest models

dribnet/pixray-text2pixel

Turn any description into pixel art

Updated 2 years, 3 months ago 104K runs

andreasjansson/fmsynth

Implicit neural differentiable FM synthesizer

Updated 2 years, 3 months ago 390 runs

dribnet/pixray-genesis

Updated 2 years, 3 months ago 160.7K runs

dribnet/pixray-tiler

Turn any description into wallpaper tiles

Updated 2 years, 3 months ago 21.3K runs

dribnet/pixray

Pixray with custom settings

Updated 2 years, 3 months ago 59K runs

dribnet/pixray-api

Uses pixray with raw settings.

Updated 2 years, 3 months ago 29.2K runs

dribnet/8bidoug

A pixray tool for 24x24 pixelart

Updated 2 years, 3 months ago 1.6K runs

devxpy/glid-3-xl-stable

Stable diffusion, but with more powerful in-painting & out-painting capabilities

Updated 2 years, 3 months ago 13.4K runs

cjwbw/stable-diffusion-v1-5

stable-diffusion with v1-5 checkpoint

Updated 2 years, 3 months ago 35.5K runs

cjwbw/stable-diffusion-aesthetic-gradients

Stable Diffusion with Aesthetic Gradients

Updated 2 years, 3 months ago 356 runs

pschaldenbrand/text2video

Method for generating bizarre looking videos from a series of language descriptions of the video. From the Bot Intelligence Group at CMU: Peter Schaldenbrand, Zhixuan Liu, & Jean Oh

Updated 2 years, 3 months ago 8.4K runs

arielreplicate/dichotomous_image_segmentation

Highly Accurate Dichotomous Image Segmentation （ECCV 2022）

Updated 2 years, 3 months ago 4.5K runs

laion-ai/laionide

Generate images from text quickly. See https://replicate.com/afiaka87/laionide-v2 for a new checkpoint.

Updated 2 years, 4 months ago 7.3K runs

afiaka87/clip-guided-diffusion

Generate image from text by guiding a denoising diffusion model. Inference is somewhat slow.

Updated 2 years, 4 months ago 43K runs

afiaka87/pyglide

The predecessor to DALLE-2, GLIDE (filtered) with faster PRK/PLMS sampling.

Updated 2 years, 4 months ago 18.6K runs

laion-ai/laionide-v3

GLIDE finetuned on LAION5B, then more on curated datasets.

Updated 2 years, 4 months ago 62K runs

orpatashnik/styleclip

Text-Driven Manipulation of StyleGAN Imagery

Updated 2 years, 4 months ago 1.3M runs

bfirsh/segformer-b0-finetuned-ade-512-512

Updated 2 years, 4 months ago 1.1M runs

methexis-inc/img2aestheticscore

Updated 2 years, 4 months ago 237.7K runs

cjwbw/waifu-diffusion

Stable Diffusion on Danbooru images

Updated 2 years, 4 months ago 1.1M runs

anotherjesse/facial-landmark-detection

mediapipe facial landmark detection demo by Marlene Mhangami

Updated 2 years, 4 months ago 434 runs

eleazhong/strotss

image to image style transfer using STROTSS loss

Updated 2 years, 4 months ago 8.9K runs

bytedance/piano-transcription

high-resolution piano transcription system: detects piano notes from audio

Updated 2 years, 4 months ago 4.2K runs

cjwbw/stable-diffusion

stable-diffusion with negative prompts, more scheduler

Updated 2 years, 4 months ago 65.3K runs

pvitoria/chromagan

An Adversarial Approach for Picture Colorization

Updated 2 years, 4 months ago 337.8K runs

annahung31/emopia

Emotional conditioned music generation using transformer-based model.

Updated 2 years, 4 months ago 3.2K runs

ekgren/structureddreaming

Styledreams -- CLIP x Stylegan2

Updated 2 years, 4 months ago 4.1K runs

hzwer/iccv2019-learningtopaint

Teach Machines to Paint

Updated 2 years, 4 months ago 2.1K runs

iigroup/tedigan

Text-Guided Diverse Face Image Manipulation

Updated 2 years, 4 months ago 8.6K runs

pschaldenbrand/style-clip-draw

Styled text-to-drawing synthesis method.

Updated 2 years, 4 months ago 2.1K runs

allenhung1025/looptest

Four-bar drum loop generation

Updated 2 years, 4 months ago 53.3K runs

yoadtew/zero-shot-image-to-text

image to text generation

Updated 2 years, 4 months ago 6.6K runs

longguangwang/arbsr

Scale-Arbitrary Super-Resolution

Updated 2 years, 4 months ago 21.2K runs

jiupinjia/stylized-neural-painting-oil

Image to oil painting generation

Updated 2 years, 4 months ago 6.3K runs

ouhenio/stylegan3-clip

stylegan3 + clip

Updated 2 years, 4 months ago 6.9K runs

yuanxunlu/livespeechportraits

Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation

Updated 2 years, 4 months ago 9.8K runs

wonjongg/stylecarigan

Caricature Generation via StyleGAN Feature Map Modulation

Updated 2 years, 4 months ago 5.4K runs

raoumer/srrescgan

Intelligent image scaling to 4x resolution

Updated 2 years, 4 months ago 40.7K runs

huage001/adaattn

Arbitrary Neural Style Transfer

Updated 2 years, 4 months ago 357.4K runs

codeslake/ifan-defocus-deblur

Removes defocus blur in an image

Updated 2 years, 4 months ago 120.1K runs

cjwbw/whisper-downloadable-subtitles

Added downloadable subtitles for openai/whisper

Updated 2 years, 4 months ago 2.2K runs

jingyunliang/hcflow-sr

Image Super-Resolution

Updated 2 years, 4 months ago 222.3K runs

xinntao/esrgan

Image 4x super-resolution

Updated 2 years, 4 months ago 79.4K runs

kyrick/prompt-parrot

Prompt Parrot generates text2image prompts from finetuned distilgpt2

Updated 2 years, 4 months ago 251.6K runs

google-research/frame-interpolation

Frame Interpolation for Large Scene Motion

Updated 2 years, 4 months ago 271.1K runs

harmonai/dance-diffusion

Tools to train a generative model on arbitrary audio samples

Updated 2 years, 4 months ago 5.2K runs

tengfei-wang/hfgi

High-Fidelity GAN Inversion for Image Attribute Editing

Updated 2 years, 4 months ago 22.1K runs

eladrich/pixel2style2pixel

a StyleGAN Encoder for Image-to-Image Translation

Updated 2 years, 4 months ago 31.9K runs

yuval-alaluf/restyle_encoder

ReStyle: A Residual-Based StyleGAN Encoder via Iterative Refinement

Updated 2 years, 4 months ago 94.6K runs

mchong6/jojogan

JoJoGAN: One Shot Face Stylization

Updated 2 years, 4 months ago 496.1K runs

Featured models

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"use-tiptap-text-input": false, "show-open-ai-api-instructions": false}} anthropic / claude-3.5-sonnet

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"use-tiptap-text-input": false, "show-open-ai-api-instructions": false}} minimax / video-01-director

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"use-tiptap-text-input": false, "show-open-ai-api-instructions": false}} google / imagen-3-fast

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"use-tiptap-text-input": false, "show-open-ai-api-instructions": false}} google / imagen-3

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"use-tiptap-text-input": false, "show-open-ai-api-instructions": false}} deepseek-ai / deepseek-r1