Explore

Featured models

anthropic / claude-3.5-sonnet

Anthropic's most intelligent language model to date, with a 200K token context window and image understanding (claude-3-5-sonnet-20241022)

6.8K runs

minimax / video-01-director

Generate videos with specific camera movements

1.1K runs

google / imagen-3-fast

A faster and cheaper Imagen 3 model, for when price or speed are more important than final image quality

25.1K runs

google / imagen-3

Google's highest quality text-to-image model, capable of generating images with detail, rich lighting and beauty

57.3K runs

deepseek-ai / deepseek-r1

A reasoning model trained with reinforcement learning, on par with OpenAI o1

311.7K runs

tencent / hunyuan-video

A state-of-the-art text-to-video generation model capable of creating high-quality videos with realistic motion from text descriptions

63.6K runs

playht / play-dialog

End-to-end AI speech model designed for natural-sounding conversational speech synthesis, with support for context-aware prosody, intonation, and emotional expression.

4.9K runs

zsxkib / mmaudio

Add sound to video. An advanced AI model that synthesizes high-quality audio from video content, enabling seamless video-to-audio transformation

118.6K runs

recraft-ai / recraft-v3

Recraft V3 (code-named red_panda) is a text-to-image model with the ability to generate long texts, and images in a wide list of styles. As of today, it is SOTA in image generation, proven by the Text-to-Image Benchmark by Artificial Analysis

1.2M runs

I want to…

Generate images

Models that generate images from text prompts

Generate videos

Models that create and edit videos

Caption images

Models that generate text from images

Transcribe speech

Models that convert speech to text

Generate text

Models that can understand and generate text

Upscale images

Upscaling models that create high-quality images from low-quality images

Use official models

Official models are always on, maintained, and have predictable pricing.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Generate speech

Convert text to speech

Caption videos

Model s that generate text from videos

Remove backgrounds

Models that remove backgrounds from images and videos

Use handy tools

Toolbelt-type models for videos and images.

Detect objects

Models that detect or segment objects in images and videos.

Generate music

Models to generate and modify music

Sing with voices

Voice-to-voice cloning and musical prosody

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Chat with images

Ask language models about images

Use a face to make images

Make realistic images of people instantly

Extract text from images

Optical character recognition (OCR) and text extraction

Get embeddings

Models that generate embeddings from inputs

Use the FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Edit images

Tools for manipulating images.

Popular models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 5 months ago 775.8M runs

salesforce/blip

Generate image captions

Updated 2 years, 4 months ago 150.3M runs

openai/whisper

Convert speech in audio to text

Updated 2 months, 3 weeks ago 65.5M runs

falcons-ai/nsfw_image_detection

Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification

Updated 1 year, 2 months ago 52.8M runs

andreasjansson/clip-features

Return CLIP features for the clip-vit-large-patch14 model

Updated 1 year, 11 months ago 77.2M runs

abiruyt/text-extract-ocr

A simple OCR Model that can easily extract text from an image.

Updated 1 year, 3 months ago 89.8M runs

nightmareai/real-esrgan

Real-ESRGAN with optional face correction and adjustable upscale

Updated 6 months, 3 weeks ago 61.3M runs

stability-ai/sdxl

A text-to-image generative AI model that creates beautiful images

Updated 8 months, 3 weeks ago 75.8M runs

Latest models

meta/omnivore

A Single Model for Many Visual Modalities

Updated 3 years ago 247 runs

meta/swag

Supervised Weakly from hashtAGs

Updated 3 years ago 294 runs

vganapati/mnist-classification

Classify numerical digits.

Updated 3 years, 1 month ago 115 runs

music-and-culture-technology-lab/omnizart

democratizing automatic music transcription

Updated 3 years, 2 months ago 3K runs

dribnet/pixray-text2pixel-0x42

Uses pixray to generate an image from text prompt

Updated 3 years, 2 months ago 148.4K runs

gnobitab/fusedream

Training-Free Text-to-Image Generation

Updated 3 years, 2 months ago 2.4K runs

dribnet/pixray-tiler-future

Updated 3 years, 2 months ago 1.7K runs

dribnet/bex-research-portfolio-code

Updated 3 years, 2 months ago 750 runs

hohsiangwu/wav2clip

Image generation from Wav2CLIP through VQGAN-CLIP

Updated 3 years, 2 months ago 896 runs

mistake0316/style-transfer-clip

Guide Style Transfer with CLIP loss

Updated 3 years, 2 months ago 1.3K runs

l3das/l3das22_challenge

Baseline models demo of the IEEE L3DAS22 Challenge

Updated 3 years, 3 months ago 228 runs

andreasjansson/cog-markdown-example

Simple example of a Cog model that produces Markdown output

Updated 3 years, 3 months ago 23 runs

huan/ding-dong

Huan's first cog with Replicate.AI

Updated 3 years, 3 months ago 35 runs

menghanxia/reversiblehalftoning

Deep Halftoning with Reversible Binary Pattern

Updated 3 years, 3 months ago 364 runs

afiaka87/mannequin-gan-3-electric-boogaloo-2

Guide a StyleGAN3 trained on pictures of mannequins with CLIP.

Updated 3 years, 3 months ago 850 runs

zeke/clip-age-prediction

Guesses your age based on a photo

Updated 3 years, 3 months ago 421 runs

andreasjansson/clip-age-predictor

Age prediction using CLIP

Updated 3 years, 3 months ago 528 runs

netease-gameai/spatchgan-selfie2anime

Selfie to anime

Updated 3 years, 3 months ago 3.1K runs

cjwbw/clip-guided-diffusion-pokemon

Generates pokemon sprites from prompt

Updated 3 years, 4 months ago 4.9K runs

avivga/zerodim

Disentangled face manipulation using CLIP-based annotations

Updated 3 years, 4 months ago 1.8K runs

avivga/overlord

Scaling-up Disentanglement for Image Translation

Updated 3 years, 4 months ago 984 runs

dribnet/clipit

Image generation with CLIP + VQGAN / PixelDraw

Updated 3 years, 4 months ago 6.7K runs

meta/ic_gan

Instance-Conditioned GAN

Updated 3 years, 4 months ago 26.7K runs

bfirsh/vqgan-clip

Generates images with VQGAN and CLIP

Updated 3 years, 4 months ago 6.6K runs

f90/wave-u-net-pytorch

Extracts "bass", "drums", "other" and "vocals" tracks from mixed audio track

Updated 3 years, 5 months ago 170 runs

x4nth055/emotion-recognition-using-speech

Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras

Updated 3 years, 5 months ago 244 runs

minivision-ai/photo2cartoon

Photo to cartoon translation

Updated 3 years, 5 months ago 4K runs

zkx06111/wsrglow

A Glow-based Waveform Generative Model for Audio Super-Resolution. Intelligently upsamples audio by 2x resolution

Updated 3 years, 5 months ago 1.5K runs

minzwon/sota-music-tagging-models

PyTorch implementation of state-of-the-art music tagging models 🎶

Updated 3 years, 5 months ago 1.9K runs

huiyegit/t2i_cl

Text-to-image synthesis using contrastive learning

Updated 3 years, 5 months ago 1.2K runs

odegeasslbc/fastgan-pytorch

A Fast and Stable GAN for Small and High Resolution Imagesets

Updated 3 years, 6 months ago 3.3K runs

Featured models

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-open-ai-api-instructions": false, "use-tiptap-text-input": false}} anthropic / claude-3.5-sonnet

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-open-ai-api-instructions": false, "use-tiptap-text-input": false}} minimax / video-01-director

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-open-ai-api-instructions": false, "use-tiptap-text-input": false}} google / imagen-3-fast

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-open-ai-api-instructions": false, "use-tiptap-text-input": false}} google / imagen-3

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"show-open-ai-api-instructions": false, "use-tiptap-text-input": false}} deepseek-ai / deepseek-r1