demofusion-enhance

Image to Image enhancer using DemoFusion

Updated 14 runs

vid2openpose

Video to OpenPose

Updated 22 runs

magic-animate-openpose

MagicAnimate using an OpenPose input video

Updated 99 runs

playground-v2

Playground v2 is a diffusion-based text-to-image generative model trained from scratch. Try out all 3 models here

Updated 102 runs

vid2densepose

Convert your videos to DensePose and use it with MagicAnimate

Updated 421 runs

style-aligned

GoogleAI: Style Aligned Image Generation via Shared Attention

Updated 459 runs

magic-animate

MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Updated 8K runs

cross-image-attention

Given two images depicting a source structure and a target appearance, generate an image merging the structure of one image with the appearance of the other

Updated 217 runs

pixart-xl-2

PixArt-Alpha 1024px is a transformer-based text-to-image diffusion system trained on text embeddings from T5

Updated 1.3K runs

pixart-lcm-xl-2

PixArt-Alpha LCM is a transformer-based text-to-image diffusion system trained on text embeddings from T5

Updated 1.7K runs

demofusion

DemoFusion: Democratising High-Resolution Image Generation With No 💰

Updated 2.4K runs

sdxl-img-blend

SDXL Image Blending

Updated 473 runs

interpany-clearer

InterpAny-Clearer: Clearer anytime frame interpolation & Manipulated interpolation

Updated 982 runs

xtts-v2

Coqui XTTS-v2: Multilingual Text To Speech Voice Cloning

Updated 8K runs

controlnet-tile

Controlnet v1.1 - Tile Version

Updated 419 runs

real-esrgan-video

Real-ESRGAN Video Upscaler

Updated 11.9K runs

seine

Image-to-video - SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

Updated 3K runs

svd

Research-only StabilityAI's Stable Video Diffusion

Updated 6.9K runs

nsfw_image_detection

Falcons.ai Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification

Updated 27.4K runs

animate-diff-sdxl-lcm

Animate Your Personalized Text-to-Image Diffusion Models with SDXL and LCM

Updated 226 runs

vseq2vseq

Text to video diffusion model with variable length frame conditioning for infinite length video

Updated 327 runs

dreamshaper7-img2img-lcm

Dreamshaper-7 img2img with LCM LoRA for faster inference

Updated 1.5K runs

realvisxl2-lcm

RealvisXL-v2.0 with LCM LoRA - requires fewer steps (4 to 8 instead of the original 40 to 50)

Updated 79.6K runs

modelscope-facefusion

Auto fuse a user's face onto the template image, with a similar appearance to the user

Updated 2.4K runs

ip_adapter-face-inpaint

A combination of ip_adapter SDv1.5 and mediapipe-face to inpaint a face

Updated 470 runs

sdxl-niji-se

SDXL_Niji_Special Edition

Updated 3.6K runs

sdxl-lcm-zeke

A fine-tuned SDXL-LCM LoRA based on the photos of Zeke

Updated 211 runs

sdxl-lcm

Latent Consistency Model (LCM): SDXL, distills the original model into a version that requires fewer steps (4 to 8 instead of the original 25 to 50)

Updated 129.1K runs

ip_adapter-sdxl-face

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate SDXL images with an image prompt

Updated 10.4K runs

sdxl-lcm-loras

POC of SDXL-LCM LoRA combined with Replicate LoRA for 4 second inference times

Updated 296 runs

lcm-ssd-1b

Latent Consistency Model (LCM): SSD-1B, is a LCM distilled version that reduces the number of inference steps needed to only 2 - 8 steps

Updated 1.4K runs

ip_adapter-face

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate SDv1.5 images with an image prompt

Updated 358 runs

realvisxl-v2.0

Implementation of SDXL RealVisXL_V2.0

Updated 30.2K runs

realvisxl2-lora-inference

POC to run inference on Realvisxl2 LoRAs

Updated 1.1K runs

realvisxl2-lora-training

POC to train Realvisxl2 LoRAs

Updated 99 runs

ssd-1b

Segmind Stable Diffusion Model (SSD-1B) is a distilled 50% smaller version of SDXL, offering a 60% speedup while maintaining high-quality text-to-image generation capabilities

Updated 195.8K runs

ssd-lora-inference

POC to run inference on SSD-1B LoRAs

Updated 998 runs

ssd-lora-training

POC to train SSD-1B LoRAs for cheaper & faster training

Updated 167 runs

ssd-1b-txt2img_batch

Batch mode for Segmind Stable Diffusion Model (SSD-1B) txt2img

Updated 250 runs

realvisxl-v2-img2img

Implementation of SDXL RealVisXL_V2.0 img2img

Updated 3.3K runs

thinkdiffusionxl

ThinkDiffusionXL is a go-to model capable of amazing photorealism that's also versatile enough to generate high-quality images across a variety of styles and subjects without needing to be a prompting genius

Updated 4.5K runs

kosmos-2

Grounding Multimodal Large Language Models to the World

Updated 1.1K runs

ssd-1b-img2img

Segmind Stable Diffusion Model (SSD-1B) img2img

Updated 1.4K runs

sdxl

SDXL v1.0 - A text-to-image generative AI model that creates beautiful images

Updated 72.4K runs

realvisxl-v1-img2img

Implementation of SDXL RealVisXL_V1.0 img2img

Updated 2K runs

dolphin-2.2.1-mistral-7b

Mistral-7B-v0.1 fine tuned for chat with the Dolphin dataset (an open-source implementation of Microsoft's Orca)

Updated 2.9K runs

dolphin-2.1-mistral-7b

Mistral-7B-v0.1 fine tuned for chat with the Dolphin dataset (an open-source implementation of Microsoft's Orca)

Updated 4.6K runs

mistrallite

MistralLiteA is a fine-tuned Mistral-7B-v0.1 language model, with enhanced capabilities of processing long context (up to 32K tokens)

Updated 508 runs

bakllava

BakLLaVA-1 is a Mistral 7B base augmented with the LLaVA 1.5 architecture

Updated 865 runs

hotshot-xl

😊 Hotshot-XL is an AI text-to-GIF model trained to work alongside Stable Diffusion XL

Updated 26.5K runs

fuyu-8b

Fuyu-8B is a multi-modal text and image transformer trained by Adept AI

Updated 2.3K runs

video-crafter

Open diffusion model for high-quality video generation

Updated 4K runs

sdxl-inpainting

SDXL Inpainting is a latent diffusion model developed by the HF Diffusers team. Keeps input aspect ratio

Updated 1.4K runs

qwen-vl-chat

A multimodal LLM-based AI assistant, which is trained with alignment techniques. Qwen-VL-Chat supports more flexible interaction, such as multi-round question answering, and creative capabilities.

Updated 9.7K runs

comfyui-sdxl-txt2img

Using a ComfyUI workflow to run SDXL text2img

Updated 280 runs

sadtalker

Stylized Audio-Driven Single Image Talking Face Animation

Updated 2.9K runs

sdxl-controlnet

SDXL ControlNet - Canny

Updated 261.7K runs

mistral-7b-v0.1

Mistral-7B-v0.1 is a pretrained generative text model that outperforms Llama 2 13B on all benchmarks

Updated 690 runs

animate-diff

Animate Your Personalized Text-to-Image Diffusion Models

Updated 113.6K runs

illusion-diffusion-hq

Monster Labs QrCode ControlNet on top of SD Realistic Vision v5.1

Updated 204.3K runs

remove-bg

Remove background from an image

Updated 33.1K runs

realvisxl-v1.0

Implementation of SDXL RealVisXL_V1.0

Updated 31.4K runs

sdxl-controlnet-depth

SDXL ControlNet - Depth

Updated 27.2K runs

phi-1.5

microsoft/phi-1.5 was trained using the same data sources as phi-1, augmented with a new data source that consists of various NLP synthetic texts

Updated 730 runs

clip-interrogator

CLIP Interrogator (for faster inference)

Updated 77.8K runs

sdxl-panoramic

360 Panorama SDXL image with inpainted wrapping seam

Updated 4.6K runs

codeformer

Robust face restoration algorithm for old photos/AI-generated faces - (A40 GPU)

Updated 43.2K runs

blueprint

An SDXL fine-tune based on blueprints

Updated 140 runs

ms-img2vid

Turn any image into a video

Updated 321K runs

wizardcoder-python-34b-v1.0

Empowering Code Large Language Models with Evol-Instruct

Updated 263 runs

idefics-80b

IDEFICS 80b Quantized

Updated 220 runs

idefics-9b

IDEFICS 9b Quantized

Updated 1.9K runs

realistic-vision-v5-openpose

Realistic Vision V5 with OpenPose

Updated 4.6K runs

spider-gwen-style

SDXL fine tune on Spider-Gwen style

Updated 156 runs

realistic-vision-v5

Realistic Vision v5.0 with VAE

Updated 5.3K runs

sdxl-controlnet-openpose

SDXL ControlNet - OpenPose

Updated 20.7K runs

realistic-vision-v5-inpainting

Realistic Vision v5.0 Inpainting

Updated 14.6K runs

realistic-vision-v5-img2img

Realistic Vision v5.0 Image 2 Image

Updated 80.4K runs

realistic-vision-v5.1

Implementation of Realistic Vision v5.1 with VAE

Updated 148.7K runs

sdxl-clip-interrogator

CLIP Interrogator for SDXL optimizes text prompts to match a given image

Updated 669.4K runs

upstage-llama-2-70b-instruct-v2

Upstage/Llama-2-70B-instruct-v2 - GPTQ

Updated 1.2K runs

glaive-function-calling-v1

2.7B param open source chat model trained on Glaive’s synthetic data generation platform

Updated 113 runs

gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces* (for larger images)

Updated 55.6K runs

freewilly2

Stability AI's FreeWilly2

Updated 302 runs

llama-2-70b-chat

Meta's Llama 2 70b Chat - GPTQ

Updated 582 runs

llama-2-13b-chat

Meta's Llama 2 13b Chat - GPTQ

Updated 17.5K runs

llama-2-7b-chat

Meta's Llama 2 7b Chat - GPTQ

Updated 19.8K runs

speaker-diarization

Segments an audio recording based on who is speaking (on A100)

Updated 4.7K runs

rivers-stable-diffusion-upscaler

RiversHaveWings Stable Diffusion Upscaler

Updated 363 runs

real-esrgan

Real-ESRGAN with optional face correction and adjustable upscale (for larger images)

Updated 11.2K runs

wsrglow

A working wsrglow model

Updated 233 runs

stable-diffusion-image-variation

Image Variations with Stable Diffusion

Updated 464 runs

realistic-vision-v4.0

Realistic Vision V4.0

Updated 38.6K runs

realistic-vision-v3.0

Realistic Vision V3.0 with VAE

Updated 4.5K runs

instruct-glaive

sahil2801/replit-code-instruct-glaive

Updated 356 runs

replit-code-v1-3b

replit/replit-code-v1-3b

Updated 124 runs

xgen-7b-8k-base

Salesforce/xgen-7b-8k-base

Updated 101 runs

vicuna-33b-v1.3

lmsys/vicuna-33b-v1.3

Updated 26.1K runs

vicuna-13b-v1.3

lmsys/vicuna-13b-v1.3

Updated 505 runs

vicuna-7b-v1.3

lmsys/vicuna-7b-v1.3

Updated 187 runs

codegen2-1b

Salesforce/codegen2-1B

Updated 573 runs

mpt-30b-chat

mosaicml/mpt-30b-chat in 8bit

Updated 2.8K runs

tiny-starcoder-py

bigcode/tiny_starcoder_py

Updated 67 runs

wizardcoder-15b-v1.0

WizardLM/WizardCoder-15B-V1.0

Updated 2.8K runs

wizardcoder-15b-v1

WizardLM/WizardCoder-15B-V1.0 in 4bit

Updated 459 runs

shiba-diffusion

Shiba stable diffusion model

Updated 676 runs