zsxkib/pulid
📖 PuLID: Pure and Lightning ID Customization via Contrastive Alignment
zsxkib/talknet-asd
🗣️ TalkNet-ASD: Detect who is speaking in a video
zsxkib/instant-id
Make realistic images of real people instantly
zsxkib/flash-face
FlashFace: Human Image Personalization with High-fidelity Identity Preservation
zsxkib/prototype-model
A test model (instantid)
zsxkib/animate-diff-scene-assembler
Dkamacho’s Scene Assembler
zsxkib/yolo-world
Real-Time Open-Vocabulary Object Detection
zsxkib/uform-gen
🖼️ Super fast 1.5B Image Captioning/VQA Multimodal LLM (Image-to-Text) 🖋️
zsxkib/moore-animateanyone
Unofficial Re-Trained AnimateAnyone (Image + DWPose Video → Animated Video of Image)
zsxkib/trocr-base-handwritten
🖋️➡️📱Converts handwritten text images into digital text
zsxkib/patch-fusion
Super High Quality Depth Maps 🗺️: An End-to-End Tile-Based Framework 🏗️ for High-Resolution Monocular Metric Depth Estimation 🔍📏
zsxkib/tortoise-then-rvc
zsxkib/create-rvc-dataset
Create your own Realistic Voice Cloning (RVC v2) dataset using a YouTube link
zsxkib/realistic-voice-cloning
Create song covers with any RVC v2 trained AI voice from audio files.
zsxkib/stable-diffusion-safety-checker
Identifies NSFW images
zsxkib/animatediff-illusions
Monster Labs' Controlnet QR Code Monster v2 For SD-1.5 on top of AnimateDiff Prompt Travel (Motion Module SD 1.5 v2)
zsxkib/film-frame-interpolation-for-large-motion
FILM: Frame Interpolation for Large Motion, In ECCV 2022.
zsxkib/prototype-model2
zsxkib/animatediff-prompt-travel
🎨AnimateDiff Prompt Travel🧭 Seamlessly Navigate and Animate Between Text-to-Image Prompts for Dynamic Visual Narratives
zsxkib/diffbir
✨DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior
zsxkib/st-mfnet
📽️ Increase Framerate 🎬 ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation
zsxkib/animate-diff
🎨 AnimateDiff (w/ MotionLoRAs for Panning, Zooming, etc): Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
zsxkib/draggan
🐲 DragGAN 🐉 - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold"
zsxkib/lil-flan-bias-logits-warper
Logit Warping via Biases for Google's FLAN-T5-small
zsxkib/clip-age-predictor
Age prediction using CLIP - Patched version of `https://replicate.com/andreasjansson/clip-age-predictor` that works with the new version of cog!
zsxkib/emotion2color
Transform your text into a beautiful two-tone color gradient that represents your emotions.
zsxkib/hello-world
A "Hello World" model for me to get to grips with `cog` and Replicate
zsxkib/aya-101
📚 Aya, an LLM by Cohere capable of understanding and generating content in 101 languages 🗣️
zsxkib/illuminati-diffusion
🧿 Illuminati Diffusion w/ Textual Inversion Embeddings 🧬
zsxkib/animate-diff-prompt-walking
zsxkib/open-sora