Explore

openai/whisper
Convert speech in audio to text

cloneofsimo/lora
LoRA Inference model with Stable Diffusion

jagilley/controlnet-canny
Modify images using canny edge detection

stability-ai/stable-diffusion
A latent text-to-image diffusion model capable of generating photo-realistic images given any text input

salesforce/blip-2
Answers questions about images

timothybrooks/instruct-pix2pix
Edit images with human instructions
Collections
Audio generation
Models to generate and modify audio
riffusion/riffusion, allenhung1025/looptest, haoheliu/audio-ldm, andreasjansson/cantable-diffuguesion, harmonai/dance-diffusion...
ControlNet
Control diffusion models
jagilley/controlnet-scribble, jagilley/controlnet-hough, jagilley/controlnet-canny, jagilley/controlnet-hed, jagilley/controlnet-depth2img...
Diffusion models
Image and video generation models trained with diffusion processes
stability-ai/stable-diffusion, cjwbw/anything-v3-better-vae, cjwbw/waifu-diffusion, cjwbw/anything-v4.0, tommoore515/material_stable_diffusion...
Image restoration
Models that improve or restore images by deblurring, colorization, and removing noise
tencentarc/gfpgan, jingyunliang/swinir, microsoft/bringing-old-photos-back-to-life, cjwbw/bigcolor, yangxy/gpen...
Image to text
Models that generate text prompts and captions from images
salesforce/blip, methexis-inc/img2prompt, pharmapsychotic/clip-interrogator, rmokady/clip_prefix_caption, j-min/clip-caption-reward...
ML makeovers
Models that let you change facial features
orpatashnik/styleclip, yuval-alaluf/sam, rinongal/stylegan-nada, yuval-alaluf/restyle_encoder, mchong6/jojogan...
Style transfer
Models that take a content image and a style reference to produce a new image
paper11667/clipstyler, huage001/adaattn, ptran1203/pytorch-animegan, ariel415el/gpdm, jiupinjia/stylized-neural-painting-oil...
Super resolution
Upscaling models that create high-quality images from low-quality images
jingyunliang/swinir, nightmareai/real-esrgan, mv-lab/swin2sr, cjwbw/rudalle-sr, jingyunliang/hcflow-sr...
Text to image
Models that generate images from text prompts
stability-ai/stable-diffusion, pixray/text2image, cjwbw/waifu-diffusion, kuprel/min-dalle, laion-ai/erlich...
Videos
Models that create and edit videos
deforum/deforum_stable_diffusion, andreasjansson/stable-diffusion-animation, nateraw/stable-diffusion-videos, nightmareai/cogvideo, arielreplicate/stable_diffusion_infinite_zoom...
Popular models
tencentarc/gfpgan
Practical face restoration algorithm for *old photos* or *AI-generated faces*
prompthero/openjourney
Stable Diffusion fine tuned on Midjourney v4 images.
jagilley/controlnet-scribble
Generate detailed images from scribbled drawings
jingyunliang/swinir
Image Restoration Using Swin Transformer
sczhou/codeformer
Robust face restoration algorithm for old photos / AI-generated faces
stability-ai/stable-diffusion-inpainting
Fill in masked parts of images with Stable Diffusion
nightmareai/real-esrgan
Real-ESRGAN with optional face correction and adjustable upscale
Latest models
cloneofsimo/lora-advanced-training
LoRA model trainer, advanced version
cloneofsimo/lora-training
LoRA model trainer with presets for faces, objects, and styles
jagilley/controlnet-scribble
Generate detailed images from scribbled drawings
andreasjansson/tile-morph
Create tileable animations with seamless transitions
cloneofsimo/fad_v0_lora
LoRA, fp16 Foto-Assisted-Diffusion-FAD_V0
cjwbw/hard-prompts-made-easy
Gradient-Based Discrete Optimization for Prompt Tuning and Discovery
pollinations/3d-photo-inpainting
3D Photography using Context-aware Layered Depth Inpainting
daanelson/motion_diffusion_model
A diffusion model for generating human motion video from a text prompt
pollinations/modnet
A deep learning approach to remove background & adding new background image
pollinations/lucid-sonic-dreams-xl
Lucid Sonic Dreams syncs StyleGAN XL -generated visuals to music
tencentarc/animesr
Real-World Super-Resolution Models for Animation Videos
tonyllondon/goal_celebration_v1
A crazy model which maybe does nothing, maybe does something, but its supposed to be about football.
cjwbw/dreambooth-avatar
Dreambooth finetuning of Stable Diffusion (v1.5.1) on Avatar art style by Lambda Labs
cjwbw/gta5_artwork_diffusion
GTA5 Artwork Diffusion via Dreambooth
pollinations/lucid-sonic-dreams
Lucid Sonic Dreams syncs GAN-generated visuals to music
facebookresearch/cutler
Cut and Learn for unsupervised object detection and instance segmentation
daanelson/attend-and-excite
Attention-Based Semantic Guidance for Text-to-Image Diffusion Models
cjwbw/magifactory-t-shirt-diffusion
Generate t-shirt logos with stable-dfffusion
daanelson/stable-diffusion-long-prompts
img2img Stable Diffusion, but with longer prompts
haoheliu/audio-ldm
Text-to-audio generation with latent diffusion models
pollinations/stable-diffusion-dance
Audio Reactive Stable Diffusion
arielreplicate/deoldify_video
Add colours to old video footage.
mbentley124/openjourney-img2img
This is the OpenJourney img2img model
cjwbw/distilgpt2-stable-diffusion-v2
Descriptive stable diffusion prompts generation using GPT2
pollinations/real-basicvsr-video-superresolution
RealBasicVSR: Investigating Tradeoffs in Real-World Video Super-Resolution
arielreplicate/crestereo
High accuracy depth maps from pairs of stereo images
arielreplicate/gscorecam-clip-analyzer
Shows what CLIP looks at in an image given text
pollinations/tune-a-video
About Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
jagilley/stable-diffusion-depth2img
Create variations of an image while preserving shape and depth
22-hours/vintedois-diffusion
Generate beautiful images with simple prompts
timothybrooks/instruct-pix2pix
Edit images with human instructions
arielreplicate/instruct-pix2pix
Edit images with human instructions
cjwbw/anything-v4.0
high-quality, highly detailed anime-style Stable Diffusion models
pwntus/stable-diffusion-depth2img
Create variations of an image while preserving shape and depth.
pollinations/rife-video-interpolation
charlesfrye/text-recognizer-gpu
Detects one paragraph of text in an image.
stability-ai/stable-diffusion-inpainting
Fill in masked parts of images with Stable Diffusion
cjwbw/point-e
Point-E: A System for Generating 3D Point Clouds from Complex Prompts
cjwbw/anything-v3-better-vae
high-quality, highly detailed anime style stable-diffusion with better VAE