Explore

openai/whisper
Convert speech in audio to text

cloneofsimo/lora
LoRA Inference model with Stable Diffusion

jagilley/controlnet-canny
Modify images using canny edge detection

stability-ai/stable-diffusion
A latent text-to-image diffusion model capable of generating photo-realistic images given any text input

salesforce/blip-2
Answers questions about images

timothybrooks/instruct-pix2pix
Edit images with human instructions
Collections
Audio generation
Models to generate and modify audio
riffusion/riffusion, allenhung1025/looptest, haoheliu/audio-ldm, andreasjansson/cantable-diffuguesion, harmonai/dance-diffusion...
ControlNet
Control diffusion models
jagilley/controlnet-scribble, jagilley/controlnet-hough, jagilley/controlnet-canny, jagilley/controlnet-hed, jagilley/controlnet-depth2img...
Diffusion models
Image and video generation models trained with diffusion processes
stability-ai/stable-diffusion, cjwbw/anything-v3-better-vae, cjwbw/waifu-diffusion, cjwbw/anything-v4.0, tommoore515/material_stable_diffusion...
Image restoration
Models that improve or restore images by deblurring, colorization, and removing noise
tencentarc/gfpgan, jingyunliang/swinir, microsoft/bringing-old-photos-back-to-life, cjwbw/bigcolor, yangxy/gpen...
Image to text
Models that generate text prompts and captions from images
salesforce/blip, methexis-inc/img2prompt, pharmapsychotic/clip-interrogator, rmokady/clip_prefix_caption, j-min/clip-caption-reward...
ML makeovers
Models that let you change facial features
orpatashnik/styleclip, yuval-alaluf/sam, rinongal/stylegan-nada, yuval-alaluf/restyle_encoder, mchong6/jojogan...
Style transfer
Models that take a content image and a style reference to produce a new image
paper11667/clipstyler, huage001/adaattn, ptran1203/pytorch-animegan, ariel415el/gpdm, jiupinjia/stylized-neural-painting-oil...
Super resolution
Upscaling models that create high-quality images from low-quality images
jingyunliang/swinir, nightmareai/real-esrgan, mv-lab/swin2sr, cjwbw/rudalle-sr, jingyunliang/hcflow-sr...
Text to image
Models that generate images from text prompts
stability-ai/stable-diffusion, pixray/text2image, cjwbw/waifu-diffusion, kuprel/min-dalle, laion-ai/erlich...
Videos
Models that create and edit videos
deforum/deforum_stable_diffusion, andreasjansson/stable-diffusion-animation, nateraw/stable-diffusion-videos, nightmareai/cogvideo, arielreplicate/stable_diffusion_infinite_zoom...
Popular models
tencentarc/gfpgan
Practical face restoration algorithm for *old photos* or *AI-generated faces*
prompthero/openjourney
Stable Diffusion fine tuned on Midjourney v4 images.
jagilley/controlnet-scribble
Generate detailed images from scribbled drawings
jingyunliang/swinir
Image Restoration Using Swin Transformer
stability-ai/stable-diffusion-inpainting
Fill in masked parts of images with Stable Diffusion
sczhou/codeformer
Robust face restoration algorithm for old photos / AI-generated faces
andreasjansson/clip-features
Return CLIP features for the clip-vit-large-patch14 model
Latest models
philz1337/controlnet-deliberate
Modify images with canny edge detection and Deliberate model
cjwbw/unidiffuser
One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale
adithram/inkpunk-diffusion
Replicate port of https://huggingface.co/Envvi/Inkpunk-Diffusion. Finetuned Stable Diffusion model trained on dreambooth. Vaguely inspired by Gorillaz, FLCL, and Yoji Shinkawa.
nvlabs/prismer
A Vision-Language Model with An Ensemble of Experts
cloneofsimo/realistic_vision_v1.3
daanelson/real-esrgan-a100
Real-ESRGAN for image upscaling on an A100
dingman081130/jdmodel
andreasjansson/clip-features
Return CLIP features for the clip-vit-large-patch14 model
daanelson/flan-t5
A general model for language tasks like classification, summarization, and more.
cloneofsimo/analog_diffusion_lora
cloneofsimo/gta5_lora
cloneofsimo/inkpunk_lora
cloneofsimo/portraitplus_lora
https://huggingface.co/wavymulder/portraitplus
cloneofsimo/openjourney_v2_lora
cjwbw/supermarionation
Finetuned Stable-diffusion from Gerry Anderson Supermarionation
daanelson/mixture-of-diffusers
Generate an image by specifying a different text prompt for each region
omerbt/multidiffusion
Fusing Diffusion Paths for Controlled Image Generation
olegmelnik/deliberate
jagilley/controlnet
Modify images with a prompt while preserving their structure
jagilley/controlnet-pose
Modify images with humans using pose detection
jagilley/controlnet-canny
Modify images using canny edge detection
workroomprds/bartbooth1
Exploring the training and use of DreamBooth, with Bart as a subject
workroomprds/jamesbooth1
To explore stablebooth (training and prompts) trained on pictures of James!
cjwbw/pastel-mix
high-quality highly detailed anime stylized latent diffusion model
cjwbw/sd-x2-latent-upscaler
Stable Diffusion x2 latent upscaler
stability-ai/stable-diffusion
A latent text-to-image diffusion model capable of generating photo-realistic images given any text input
anotherjesse/dreambooth-batch
flores-o/sd-x2-latent-upscaler
matt-bornstein/nspm1
Fine-tuned model based on Nick St. Pierre's latest Midjourney model!
cjwbw/t2i-adapter
Learning Adapters towards Controllable for Text-to-Image Diffusion Models
mtg/music-classifiers
Transfer learning models for music classification by genres, moods, and instrumentation
daanelson/yolox
High performance and lightweight object detection models
daanelson/plug_and_play_image_translation
Edit an image using features from diffusion models