Explore

andreasjansson/cantable-diffuguesion
Bach chorale generation and harmonization

sczhou/lednet
Joint Low-light Enhancement and Deblurring in the Dark

stability-ai/stable-diffusion
A latent text-to-image diffusion model capable of generating photo-realistic images given any text input

riffusion/riffusion
Stable diffusion for real-time music generation

openai/whisper
Convert speech in audio to text

lambdal/text-to-pokemon
Generate Pokémon from a text description
Collections
Diffusion models
Image and video generation models trained with diffusion processes
stability-ai/stable-diffusion, cjwbw/waifu-diffusion, tommoore515/material_stable_diffusion, cjwbw/anything-v3.0, cjwbw/stable-diffusion-high-resolution...
Image to text
Models that generate text prompts and captions from images
salesforce/blip, methexis-inc/img2prompt, rmokady/clip_prefix_caption, pharmapsychotic/clip-interrogator, j-min/clip-caption-reward...
Super resolution
Upscaling models that create high-quality images from low-quality images
jingyunliang/swinir, nightmareai/real-esrgan, cjwbw/swin2sr, cjwbw/rudalle-sr, nightmareai/latent-sr...
Style transfer
Models that take a content image and a style reference to produce a new image
paper11667/clipstyler, ptran1203/pytorch-animegan, ariel415el/gpdm, jiupinjia/stylized-neural-painting-oil, huage001/adaattn...
ML makeovers
Models that let you change facial features
orpatashnik/styleclip, yuval-alaluf/sam, rinongal/stylegan-nada, yuval-alaluf/restyle_encoder, eladrich/pixel2style2pixel...
Image restoration
Models that improve or restore images by deblurring, colorization, and removing noise
tencentarc/gfpgan, jingyunliang/swinir, microsoft/bringing-old-photos-back-to-life, yangxy/gpen, google-research/maxim...
Text to image
Models that generate images from text prompts
stability-ai/stable-diffusion, pixray/text2image, cjwbw/waifu-diffusion, kuprel/min-dalle, laion-ai/erlich...
Popular models
andreasjansson/clip-features
Return CLIP features for the clip-vit-large-patch14 model
tencentarc/gfpgan
Practical face restoration algorithm for *old photos* or *AI-generated faces*
prompthero/openjourney
Stable Diffusion fine tuned on Midjourney v4 images.
stability-ai/stable-diffusion-inpainting
Fill in masked parts of images with Stable Diffusion
devxpy/cog-wav2lip
sczhou/codeformer
Robust Face Restoration algorithm for old photos / AI-generated faces
Latest models
arielreplicate/crestereo
High accuracy depth maps from couples of stereo images
arielreplicate/gscorecam-clip-analyzer
Shows what CLIP looks at in an image given a text.
pollinations/real-basicvsr-video-superresolution
RealBasicVSR: Investigating Tradeoffs in Real-World Video Super-Resolution
cjwbw/cutler
Cut and Learn for Unsupervised Object Detection and Instance Segmentation
pollinations/tune-a-video
About Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
arielreplicate/yolox
High performance and lightweight object detection models
jagilley/stable-diffusion-depth2img
Create variations of an image while preserving shape and depth