lucataco

Luis C.
GitHub
demofusion-enhance
Image to Image enhancer using DemoFusion

vid2openpose
Video to OpenPose

magic-animate-openpose
MagicAnimate using an OpenPose input video

playground-v2
Playground v2 is a diffusion-based text-to-image generative model trained from scratch. Try out all 3 models here

vid2densepose
Convert your videos to DensePose and use it with MagicAnimate

style-aligned
GoogleAI: Style Aligned Image Generation via Shared Attention

magic-animate
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

cross-image-attention
Given two images depicting a source structure and a target appearance, generate an image merging the structure of one image with the appearance of the other

pixart-xl-2
PixArt-Alpha 1024px is a transformer-based text-to-image diffusion system trained on text embeddings from T5

pixart-lcm-xl-2
PixArt-Alpha LCM is a transformer-based text-to-image diffusion system trained on text embeddings from T5

demofusion
DemoFusion: Democratising High-Resolution Image Generation With No 💰

sdxl-img-blend
SDXL Image Blending

interpany-clearer
InterpAny-Clearer: Clearer anytime frame interpolation & Manipulated interpolation

xtts-v2
Coqui XTTS-v2: Multilingual Text To Speech Voice Cloning

controlnet-tile
Controlnet v1.1 - Tile Version

real-esrgan-video
Real-ESRGAN Video Upscaler

seine
Image-to-video - SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

svd
Research-only StabilityAI's Stable Video Diffusion

nsfw_image_detection
Falcons.ai Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification

animate-diff-sdxl-lcm
Animate Your Personalized Text-to-Image Diffusion Models with SDXL and LCM

vseq2vseq
Text to video diffusion model with variable length frame conditioning for infinite length video

dreamshaper7-img2img-lcm
Dreamshaper-7 img2img with LCM LoRA for faster inference

realvisxl2-lcm
RealvisXL-v2.0 with LCM LoRA - requires fewer steps (4 to 8 instead of the original 40 to 50)

modelscope-facefusion
Auto fuse a user's face onto the template image, with a similar appearance to the user

ip_adapter-face-inpaint
A combination of ip_adapter SDv1.5 and mediapipe-face to inpaint a face

sdxl-niji-se
SDXL_Niji_Special Edition

sdxl-lcm-zeke
A fine-tuned SDXL-LCM LoRA based on the photos of Zeke

sdxl-lcm
Latent Consistency Model (LCM): SDXL, distills the original model into a version that requires fewer steps (4 to 8 instead of the original 25 to 50)

ip_adapter-sdxl-face
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate SDXL images with an image prompt

sdxl-lcm-loras
POC of SDXL-LCM LoRA combined with Replicate LoRA for 4 second inference times

lcm-ssd-1b
Latent Consistency Model (LCM): SSD-1B, is a LCM distilled version that reduces the number of inference steps needed to only 2 - 8 steps

ip_adapter-face
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate SDv1.5 images with an image prompt

realvisxl-v2.0
Implementation of SDXL RealVisXL_V2.0

realvisxl2-lora-inference
POC to run inference on Realvisxl2 LoRAs

realvisxl2-lora-training
POC to train Realvisxl2 LoRAs

ssd-1b
Segmind Stable Diffusion Model (SSD-1B) is a distilled 50% smaller version of SDXL, offering a 60% speedup while maintaining high-quality text-to-image generation capabilities

ssd-lora-inference
POC to run inference on SSD-1B LoRAs

ssd-lora-training
POC to train SSD-1B LoRAs for cheaper & faster training

ssd-1b-txt2img_batch
Batch mode for Segmind Stable Diffusion Model (SSD-1B) txt2img

realvisxl-v2-img2img
Implementation of SDXL RealVisXL_V2.0 img2img

thinkdiffusionxl
ThinkDiffusionXL is a go-to model capable of amazing photorealism that's also versatile enough to generate high-quality images across a variety of styles and subjects without needing to be a prompting genius

kosmos-2
Grounding Multimodal Large Language Models to the World

ssd-1b-img2img
Segmind Stable Diffusion Model (SSD-1B) img2img

sdxl
SDXL v1.0 - A text-to-image generative AI model that creates beautiful images

realvisxl-v1-img2img
Implementation of SDXL RealVisXL_V1.0 img2img

dolphin-2.2.1-mistral-7b
Mistral-7B-v0.1 fine tuned for chat with the Dolphin dataset (an open-source implementation of Microsoft's Orca)

dolphin-2.1-mistral-7b
Mistral-7B-v0.1 fine tuned for chat with the Dolphin dataset (an open-source implementation of Microsoft's Orca)

mistrallite
MistralLiteA is a fine-tuned Mistral-7B-v0.1 language model, with enhanced capabilities of processing long context (up to 32K tokens)

bakllava
BakLLaVA-1 is a Mistral 7B base augmented with the LLaVA 1.5 architecture

hotshot-xl
😊 Hotshot-XL is an AI text-to-GIF model trained to work alongside Stable Diffusion XL

fuyu-8b
Fuyu-8B is a multi-modal text and image transformer trained by Adept AI

video-crafter
Open diffusion model for high-quality video generation

sdxl-inpainting
SDXL Inpainting is a latent diffusion model developed by the HF Diffusers team. Keeps input aspect ratio

qwen-vl-chat
A multimodal LLM-based AI assistant, which is trained with alignment techniques. Qwen-VL-Chat supports more flexible interaction, such as multi-round question answering, and creative capabilities.

comfyui-sdxl-txt2img
Using a ComfyUI workflow to run SDXL text2img

sadtalker
Stylized Audio-Driven Single Image Talking Face Animation

sdxl-controlnet
SDXL ControlNet - Canny

mistral-7b-v0.1
Mistral-7B-v0.1 is a pretrained generative text model that outperforms Llama 2 13B on all benchmarks
animate-diff
Animate Your Personalized Text-to-Image Diffusion Models

illusion-diffusion-hq
Monster Labs QrCode ControlNet on top of SD Realistic Vision v5.1

remove-bg
Remove background from an image

realvisxl-v1.0
Implementation of SDXL RealVisXL_V1.0

sdxl-controlnet-depth
SDXL ControlNet - Depth

phi-1.5
microsoft/phi-1.5 was trained using the same data sources as phi-1, augmented with a new data source that consists of various NLP synthetic texts

clip-interrogator
CLIP Interrogator (for faster inference)

sdxl-panoramic
360 Panorama SDXL image with inpainted wrapping seam

codeformer
Robust face restoration algorithm for old photos/AI-generated faces - (A40 GPU)

blueprint
An SDXL fine-tune based on blueprints

ms-img2vid
Turn any image into a video

wizardcoder-python-34b-v1.0
Empowering Code Large Language Models with Evol-Instruct

idefics-80b
IDEFICS 80b Quantized

idefics-9b
IDEFICS 9b Quantized

realistic-vision-v5-openpose
Realistic Vision V5 with OpenPose

spider-gwen-style
SDXL fine tune on Spider-Gwen style

realistic-vision-v5
Realistic Vision v5.0 with VAE

sdxl-controlnet-openpose
SDXL ControlNet - OpenPose

realistic-vision-v5-inpainting
Realistic Vision v5.0 Inpainting

realistic-vision-v5-img2img
Realistic Vision v5.0 Image 2 Image

realistic-vision-v5.1
Implementation of Realistic Vision v5.1 with VAE

sdxl-clip-interrogator
CLIP Interrogator for SDXL optimizes text prompts to match a given image

upstage-llama-2-70b-instruct-v2
Upstage/Llama-2-70B-instruct-v2 - GPTQ

glaive-function-calling-v1
2.7B param open source chat model trained on Glaive’s synthetic data generation platform

gfpgan
Practical face restoration algorithm for *old photos* or *AI-generated faces* (for larger images)

freewilly2
Stability AI's FreeWilly2

llama-2-70b-chat
Meta's Llama 2 70b Chat - GPTQ

llama-2-13b-chat
Meta's Llama 2 13b Chat - GPTQ

llama-2-7b-chat
Meta's Llama 2 7b Chat - GPTQ

speaker-diarization
Segments an audio recording based on who is speaking (on A100)

rivers-stable-diffusion-upscaler
RiversHaveWings Stable Diffusion Upscaler

real-esrgan
Real-ESRGAN with optional face correction and adjustable upscale (for larger images)
wsrglow
A working wsrglow model

stable-diffusion-image-variation
Image Variations with Stable Diffusion

realistic-vision-v4.0
Realistic Vision V4.0

realistic-vision-v3.0
Realistic Vision V3.0 with VAE

instruct-glaive
sahil2801/replit-code-instruct-glaive

replit-code-v1-3b
replit/replit-code-v1-3b

xgen-7b-8k-base
Salesforce/xgen-7b-8k-base

vicuna-33b-v1.3
lmsys/vicuna-33b-v1.3

vicuna-13b-v1.3
lmsys/vicuna-13b-v1.3

vicuna-7b-v1.3
lmsys/vicuna-7b-v1.3

codegen2-1b
Salesforce/codegen2-1B

mpt-30b-chat
mosaicml/mpt-30b-chat in 8bit

tiny-starcoder-py
bigcode/tiny_starcoder_py

wizardcoder-15b-v1.0
WizardLM/WizardCoder-15B-V1.0

wizardcoder-15b-v1
WizardLM/WizardCoder-15B-V1.0 in 4bit

shiba-diffusion
Shiba stable diffusion model