Explore

Featured models

anthropic / claude-3.5-sonnet

Anthropic's most intelligent language model to date, with a 200K token context window and image understanding (claude-3-5-sonnet-20241022)

6.8K runs

minimax / video-01-director

Generate videos with specific camera movements

1.1K runs

google / imagen-3-fast

A faster and cheaper Imagen 3 model, for when price or speed are more important than final image quality

25.1K runs

google / imagen-3

Google's highest quality text-to-image model, capable of generating images with detail, rich lighting and beauty

57.3K runs

deepseek-ai / deepseek-r1

A reasoning model trained with reinforcement learning, on par with OpenAI o1

311.9K runs

tencent / hunyuan-video

A state-of-the-art text-to-video generation model capable of creating high-quality videos with realistic motion from text descriptions

63.6K runs

playht / play-dialog

End-to-end AI speech model designed for natural-sounding conversational speech synthesis, with support for context-aware prosody, intonation, and emotional expression.

4.9K runs

zsxkib / mmaudio

Add sound to video. An advanced AI model that synthesizes high-quality audio from video content, enabling seamless video-to-audio transformation

118.6K runs

recraft-ai / recraft-v3

Recraft V3 (code-named red_panda) is a text-to-image model with the ability to generate long texts, and images in a wide list of styles. As of today, it is SOTA in image generation, proven by the Text-to-Image Benchmark by Artificial Analysis

1.2M runs

I want to…

Generate images

Models that generate images from text prompts

Generate videos

Models that create and edit videos

Caption images

Models that generate text from images

Transcribe speech

Models that convert speech to text

Generate text

Models that can understand and generate text

Upscale images

Upscaling models that create high-quality images from low-quality images

Use official models

Official models are always on, maintained, and have predictable pricing.

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Enhance videos

Models that enhance videos with super-resolution, sound effects, motion capture and other useful production effects.

Generate speech

Convert text to speech

Caption videos

Model s that generate text from videos

Remove backgrounds

Models that remove backgrounds from images and videos

Use handy tools

Toolbelt-type models for videos and images.

Detect objects

Models that detect or segment objects in images and videos.

Generate music

Models to generate and modify music

Sing with voices

Voice-to-voice cloning and musical prosody

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Chat with images

Ask language models about images

Use a face to make images

Make realistic images of people instantly

Extract text from images

Optical character recognition (OCR) and text extraction

Get embeddings

Models that generate embeddings from inputs

Use the FLUX family of models

The FLUX family of text-to-image models from Black Forest Labs

Use FLUX fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Control image generation

Guide image generation with more than just text. Use edge detection, depth maps, and sketches to get the results you want.

Edit images

Tools for manipulating images.

Popular models

bytedance/sdxl-lightning-4step

SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps

Updated 5 months ago 775.8M runs

salesforce/blip

Generate image captions

Updated 2 years, 4 months ago 150.3M runs

openai/whisper

Convert speech in audio to text

Updated 2 months, 3 weeks ago 65.5M runs

falcons-ai/nsfw_image_detection

Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification

Updated 1 year, 2 months ago 52.8M runs

andreasjansson/clip-features

Return CLIP features for the clip-vit-large-patch14 model

Updated 1 year, 11 months ago 77.2M runs

abiruyt/text-extract-ocr

A simple OCR Model that can easily extract text from an image.

Updated 1 year, 3 months ago 89.8M runs

nightmareai/real-esrgan

Real-ESRGAN with optional face correction and adjustable upscale

Updated 6 months, 3 weeks ago 61.3M runs

stability-ai/sdxl

A text-to-image generative AI model that creates beautiful images

Updated 8 months, 3 weeks ago 75.8M runs

Latest models

cjwbw/mindall-e

text-to-image generation

Updated 2 years, 6 months ago 1.8K runs

cjwbw/vq-diffusion

VQ-Diffusion for Text-to-Image Synthesis

Updated 2 years, 6 months ago 20.7K runs

pollinations/adampi

Create a 3D photo from single in-the-wild 2D images

Updated 2 years, 6 months ago 5.7K runs

ravespaceio/min-dalle

Updated 2 years, 6 months ago 6.8K runs

nightmareai/latent-viz

Visualize the encoded latents of an image

Updated 2 years, 6 months ago 72.7K runs

laion-ai/ongo

Generate a painting using text.

Updated 2 years, 6 months ago 133.6K runs

laion-ai/erlich

Generate a logo using text.

Updated 2 years, 6 months ago 348.8K runs

afiaka87/glid-3-xl

CompVis `latent-diffusion text2im` finetuned for inpainting.

Updated 2 years, 6 months ago 8K runs

cjwbw/compositional-vsual-generation-with-composable-diffusion-models-pytorch

Composable Diffusion

Updated 2 years, 6 months ago 845 runs

cjwbw/micromotion-stylegan

Decoding Micromotion in Low-dimensional Latent Spaces from StyleGAN

Updated 2 years, 6 months ago 8.2K runs

cjwbw/clip-gen

Language-Free Training of a Text-to-Image Generator with CLIP

Updated 2 years, 6 months ago 957 runs

cjwbw/bigcolor

Colorization using a Generative Color Prior for Natural Images

Updated 2 years, 6 months ago 553.6K runs

cjwbw/global_tracking_transformers

Global Tracking Transformers

Updated 2 years, 6 months ago 148 runs

cjwbw/vqfr

Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder

Updated 2 years, 6 months ago 140K runs

cjwbw/diffae

Image Manipulatinon with Diffusion Autoencoders

Updated 2 years, 6 months ago 17.1K runs

kuprel/min-dalle

Fast, minimal port of DALL·E Mini to PyTorch

Updated 2 years, 6 months ago 504.8K runs

afiaka87/tortoise-tts

Generate speech from text, clone voices from mp3 files. From James Betker AKA "neonbjb".

Updated 2 years, 6 months ago 167.3K runs

tencentarc/vqfr

Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder

Updated 2 years, 6 months ago 180.4K runs

ml6/julius

Generate a collection of logos based on your text input. Use longer and more detailed inputs for better results. The first time it takes a few minutes to load the model. Subsequent generations are much faster.

Updated 2 years, 6 months ago 4.4K runs

jarrentwu1031/ccpl

Contrastive Coherence Preserving Loss for Versatile Style Transfer

Updated 2 years, 6 months ago 1.9K runs

nightmareai/k-diffusion

CLIP Guided latent k-diffusion

Updated 2 years, 6 months ago 7.4K runs

nightmareai/disco-diffusion

Generate images using a variety of techniques - Powered by Discoart

Updated 2 years, 6 months ago 64.5K runs

nightmareai/majesty-diffusion

Generate images from text using CLIP guided latent diffusion

Updated 2 years, 6 months ago 8.3K runs

nightmareai/cogvideo

Text-to-video generation

Updated 2 years, 6 months ago 32.8K runs

sanzgiri/cartoonify_video

Cartoonifies a video

Updated 2 years, 6 months ago 14.4K runs

sanzgiri/cartoonify

Cartoonifies an image

Updated 2 years, 6 months ago 4.4K runs

nicholascelestin/real-esrgan-nitroviper

DO NOT USE - Broken - Only Public For API Usage & Debugging

Updated 2 years, 7 months ago 5.4K runs

nightmareai/latent-sr

Upscale images with the latent diffusion superresolution model

Updated 2 years, 7 months ago 115.3K runs

laion-ai/deep-image-diffusion-prior

Generate an image using text by visualizing CLIP features.

Updated 2 years, 7 months ago 1.1K runs

evilstreak/clipdraw-interactive

Morphs vector paths towards a text prompt

Updated 2 years, 7 months ago 183.9K runs

laion-ai/puck

Generate retro videogame art using text.

Updated 2 years, 7 months ago 4.9K runs

wyhsirius/lia

Learning to Animate Images via Latent Space Navigation

Updated 2 years, 7 months ago 20.3K runs

fenglinglwb/large-hole-image-inpainting

MAT: Mask-Aware Transformer for Large Hole Image Inpainting

Updated 2 years, 7 months ago 16.9K runs

renyurui/controllable-person-synthesis

Human pose manipulation for fashion

Updated 2 years, 7 months ago 3.5K runs

davidgillsjo/srw-net

Semantic Room Wireframe

Updated 2 years, 7 months ago 3.1K runs

nightmareai/arf-svox2

Artistic Radiance Fields - Transfer the style of an image to a 3D scene (NeRF)

Updated 2 years, 7 months ago 16K runs

storymy/take-off-eyeglasses

Remove eyeglasses and shadows from photo

Updated 2 years, 7 months ago 35.9K runs

nicholascelestin/latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Updated 2 years, 7 months ago 5.6K runs

winycg/anchor_net

Localizing Semantic Patches for Accelerating Image Classification

Updated 2 years, 7 months ago 220 runs

wzx0826/lbnet

Lightweight Bimodal Network for Single-Image Super-Resolution via Symmetric CNN and Recursive Transformer

Updated 2 years, 7 months ago 7K runs

afiaka87/ldm-autoedit

Updated 2 years, 8 months ago 1.5K runs

nicholascelestin/glid-3

Generate images quickly with GLID-3 (non-xl)

Updated 2 years, 8 months ago 3.9K runs

nicholascelestin/dalle-mega

Made public only for API calls. Use min-dalle instead-- it's superior.

Updated 2 years, 8 months ago 12.1K runs

borisdayma/dalle-mini

Generate images from a text prompt

Updated 2 years, 8 months ago 58.3K runs

vis-opt-group/sci

Low-Light Image Enhancement

Updated 2 years, 8 months ago 12K runs

elleo/uk-petition-generator

Generate petitions suitable for sending to the UK government

Updated 2 years, 8 months ago 108 runs

j-min/clip-caption-reward

Fine-grained Image Captioning with CLIP Reward

Updated 2 years, 8 months ago 296.1K runs

pixray/text2image-future

pixray text2image (future branch)

Updated 2 years, 8 months ago 24.9K runs

cjwbw/face-align-cog

face alignment using stylegan-encoding

Updated 2 years, 8 months ago 4.7K runs

mchong6/gans-n-roses

Convert image or video of your face to anime

Updated 2 years, 8 months ago 4.8K runs

Featured models

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"use-tiptap-text-input": false, "show-open-ai-api-instructions": false}} anthropic / claude-3.5-sonnet

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"use-tiptap-text-input": false, "show-open-ai-api-instructions": false}} minimax / video-01-director

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"use-tiptap-text-input": false, "show-open-ai-api-instructions": false}} google / imagen-3-fast

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"use-tiptap-text-input": false, "show-open-ai-api-instructions": false}} google / imagen-3

{"icon": "SealCheck", "tooltipText": "Official models are always on, maintained, and have predictable pricing.", "anchorClass": "group-focus:text-r8-gray-1", "__flags": {"use-tiptap-text-input": false, "show-open-ai-api-instructions": false}} deepseek-ai / deepseek-r1