Explore

I want to…

Upscale images

Upscaling models that create high-quality images from low-quality images

Restore images

Models that improve or restore images by deblurring, colorization, and removing noise

Flux fine-tunes

Browse the diverse range of fine-tunes the community has custom-trained on Replicate

Make 3D stuff

Models that generate 3D objects, scenes, radiance fields, textures and multi-views.

Latest models

A 1.4B parameter text2im model from CompVis, finetuned on CLIP text embeds and curated data.

Updated 45.5K runs

Facial Expression Recognition using Residual Masking Network

Updated 15.2K runs

Transfer the texture/style of one image onto another

Updated 7.6K runs

A simple Hebrew Diacritizer

Updated 125 runs

Updated 269 runs

Detect Animals, Vehicles and Humans in Camera Trap Imagery

Updated 563 runs

Plugging Visual Controls in Text Generation

Updated 1.4K runs

Updated 15 runs

Practical Blind Denoising via Swin-Conv-UNet and Data Synthesis

Updated 23.5K runs

Music inpainting of melody and chords

Updated 8.5K runs

Nonlinear Activation Free Network for Image Restoration

Updated 1.3M runs

Zero shot Sound separation by arbitrary query samples

Updated 41.5K runs

Multi-Axis MLP for Image Processing

Updated 474.7K runs

Detect and simplify the contours of a binary image

Updated 221 runs

Super-resolves an LR video frame (ultra-wide) using a reference video frame (wide-angle)

Updated 14.3K runs

Homage to the Pixel: text prompt to 6 color squares

Updated 9.4K runs

[CVPR 2022] Unsupervised Image-to-Image Translation with Generative Prior

Updated 1.2K runs

bare pixray for API use

Updated 11.6K runs

Design Your Hair by Text and Reference Image

Updated 282.8K runs

A Steerable Model for Bach Chorales Generation

Updated 844 runs

One-shot (any-to-any) Voice Conversion

Updated 6.3K runs

Online demo for "Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation"

Updated 1.9K runs

A model for testing pydantic cog that yields images one word at a time.

Updated 126 runs

Image Style Transfer with Text Condition

Updated 25.6K runs

Synthesize drawings to match a text prompt

Updated 5.5K runs

A model for testing pydantic cog that generates images.

Updated 409 runs

A test model that generates Haiku (and yields output one word a time)

Updated 104 runs

Clip-Guided Diffusion Model for Image Generation

Updated 4.5K runs

GLIDE-text2im w/ humans and experimental style prompts.

Updated 9.2K runs

Updated 565 runs

A fork of pixray/pixray for trying out Cog's new Predictor API

Updated 58 runs

Grad-CAM visualizations for Align before Fuse

Updated 3.6K runs

Updated 75 runs

GLIDE from OpenAI finetuned on roughly 30M more samples. See `laionide-v3` for the latest.

Updated 3.8K runs

Transcribes piano audio and makes it into a cool video

Updated 218 runs

Masked-attention Mask Transformer for Universal Image Segmentation

Updated 658 runs

Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement

Updated 358 runs

Updated 94 runs

Updated 147 runs

Generate Pokemons with Projected GAN

Updated 9.7K runs

Text-Guided Image Generation and Manipulation

Updated 824 runs

A Single Model for Many Visual Modalities

Updated 247 runs

Supervised Weakly from hashtAGs

Updated 294 runs

Classify numerical digits.

Updated 115 runs

democratizing automatic music transcription

Updated 3K runs

Uses pixray to generate an image from text prompt

Updated 148.4K runs

Training-Free Text-to-Image Generation

Updated 2.4K runs

Updated 1.7K runs

Updated 750 runs

Image generation from Wav2CLIP through VQGAN-CLIP

Updated 896 runs