Get individual instruments/vocals from any music file
Create variations of an image while preserving shape and depth
Upscale images with Stable Diffusion
Modify images using canny edge detection
Modify images using HED maps
Generate detailed images from scribbled drawings
Modify images using semantic segmentation
Modify images using M-LSD line detection
Modify images using depth maps
Modify images using normal maps
Modify images with humans using pose detection
Modify images with a prompt while preserving their structure
Change voice for spoken text
This model is not yet booted but ready for API calls. Your first API call will boot the model and may take longer, but after that subsequent responses will be fast.
This model runs on T4.