xiankgx / controlnet-tile-image-detailer

xiankgx / voice-tools
Text-to-speech with OpenAI or MetaVoice-1B and voice clone with OpenVoice v2

xiankgx / musetalk
MuseTalk: Real-Time High Quality Lip Synchronization with Latent Space Inpainting

xiankgx / panda-70m-video-captioning
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

xiankgx / sdxl-evolution-0.1
Genetic algorithm like mixing of SDXL models

xiankgx / lcm-lora-sd-img2img
LCM-LoRA Lycon Dreamshaper8 img2img

xiankgx / saliency-crop
Saliency cropping using TranSalNet

xiankgx / video-retalking
VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing in the Wild

xiankgx / short-to-long-video-diffusion
SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

xiankgx / lipsick
Fast, High Quality, Low Resource Lipsync Tool