Hyper FLUX 16-step by ByteDance
high-resolution piano transcription system: detects piano notes from audio
SDXL-Lightning by ByteDance: a fast text-to-image model that makes high-quality images in 4 steps
Domain Consistent Resolution Adapter for Diffusion Models: generating consistent images with resolutions outside of their trained domain
📖 PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Hyper FLUX 8-step by ByteDance
⚡️FLUX PuLID: FLUX-dev based Pure and Lightning ID Customization via Contrastive Alignment🎭
LatentSync: generate high-quality lip sync animations
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
🥯ByteDance Seed's Bagel Unified multimodal AI that generates images, edits images, and understands images in one 7B parameter model🥯
A video generation model that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 720p resolution
A pro version of Seedance that offers text-to-video and image-to-video support for 5s or 10s videos, at 480p and 1080p resolution
Document Image Parsing via Heterogeneous Anchor Prompting
A text-to-image model with support for native high-resolution (2K) image generation
Text-guided image editing model that preserves original details while making targeted modifications like lighting changes, object removal, and style conversion
This model is warm. You'll get a fast response if the model is warm and already running, and a slower response if the model is cold and starting up.
This model runs on H100. View more.