
clip-guided-diffusion
Generate image from text by guiding a denoising diffusion model. Inference is somewhat slow.

pyglide
The predecessor to DALLE-2, GLIDE (filtered) with faster PRK/PLMS sampling.

sd-aesthetic-guidance
Use stable diffusion and aesthetic CLIP embeddings to guide boring outputs to be more aesthetically pleasing.

retrieval-augmented-diffusion
Generate 768px images from text using CompVis `retrieval-augmented-diffusion`

glid-3-xl
CompVis `latent-diffusion text2im` finetuned for inpainting.
tortoise-tts
Generate speech from text, clone voices from mp3 files. From James Betker AKA "neonbjb".

ldm-autoedit

laionide-v4
GLIDE-text2im w/ humans and experimental style prompts.

mannequin-gan-3-electric-boogaloo-2
Guide a StyleGAN3 trained on pictures of mannequins with CLIP.