Generates speech from text
Text-Guided Image Generation and Manipulation
PyTorch version of Lightweight OpenPose as introduced in "Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose"
Modify images using line art
Modify images using canny edges
Modify images using sketches
Modify images using human pose
Modify images using depth maps
Inst-Inpaint: Instructing to Remove Objects with Diffusion Models
Zero-shot / open vocabulary object detection
Generate videos from text prompts with Kandinsky-2.2
Detect everything with language!
Generates 3D assets from images
Generate 3D assets using text descriptions
Detects objects in an image
Performs speaker identity verification
Generate texture for your mesh with text prompts
Kosmos-G: Generating Images in Context with Multimodal Large Language Models
Edit real or generated images
Editable image generation with MasaCtrl-SDXL
This model is cold. You'll get a fast response if the model is warm and already running, and a slower response if the model is cold and starting up.
This model runs on T4. View more.