Stable Diffusion XL specifically trained on Inpainting by huggingface
Generate sounds from a text prompt
Get the image embeddings from segement anything