high-resolution piano transcription system: detects piano notes from audio
🥯ByteDance Seed's Bagel Unified multimodal AI that generates images, edits images, and understands images in one 7B parameter model🥯
Document Image Parsing via Heterogeneous Anchor Prompting
⚡️FLUX PuLID: FLUX-dev based Pure and Lightning ID Customization via Contrastive Alignment🎭
Hyper FLUX 16-step by ByteDance
Hyper FLUX 8-step by ByteDance
LatentSync: generate high-quality lip sync animations
📖 PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Domain Consistent Resolution Adapter for Diffusion Models: generating consistent images with resolutions outside of their trained domain
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps
A video generation model that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 720p resolution
A pro version of Seedance that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 1080p resolution
A text-to-image model with support for native high-resolution (2K) image generation
This model is cold. You'll get a fast response if the model is warm and already running, and a slower response if the model is cold and starting up.
This model runs on T4. View more.