idefics

Open-access reproduction of large visual language model Flamingo

Updated 12 hours ago 79 runs

wuerstchen

Efficient Pretraining of Text-to-Image Models

Updated 6 days, 10 hours ago 751 runs

seamless_communication

SeamlessM4T—Massively Multilingual & Multimodal Machine Translation

Updated 1 week, 1 day ago 214 runs

i2vgen-xl

Generating high-definition videos based on input images and videos.

Updated 3 weeks, 3 days ago 1.8K runs

unival

Unified Model for Image, Video, Audio and Language Tasks

Updated 1 month, 1 week ago 210 runs

lorahub

Efficient Cross-Task Generalization via Dynamic LoRA Composition

Updated 1 month, 2 weeks ago 59 runs

resshift

Efficient Diffusion Model for Image Super-resolution by Residual Shifting

Updated 1 month, 2 weeks ago 620 runs

ledits

Real Image Editing with DDPM Inversion and Semantic Guidance

Updated 1 month, 4 weeks ago 635 runs

kandinsky-2-2-controlnet-depth

Kandinsky Image Generation with ControlNet Conditioning

Updated 2 months ago 2.3K runs

styledrop

Text-to-Image Generation in Any Style

Updated 2 months, 2 weeks ago 970 runs

demucs

Demucs Music Source Separation

Updated 2 months, 2 weeks ago 2.9K runs

diffedit-stable-diffusion

Diffusion-based semantic image editing with mask guidance

Updated 3 months, 1 week ago 262 runs

textdiffuser

Diffusion Models as Text Painters

Updated 3 months, 2 weeks ago 1.3K runs

prompt-free-diffusion

Prompt-free Diffusion

Updated 3 months, 2 weeks ago 625 runs

controlvideo

Training-free Controllable Text-to-Video Generation

Updated 3 months, 3 weeks ago 1.2K runs

shap-e

Generating Conditional 3D Implicit Functions

Updated 4 months ago 7.4K runs

fastcomposer

Tuning-Free Multi-Subject Image Generation with Localized Attention

Updated 4 months ago 2.1K runs

sadtalker

Stylized Audio-Driven Single Image Talking Face Animation

Updated 4 months, 1 week ago 21.1K runs

semantic-segment-anything

Adding semantic labels for segment anything

Updated 5 months, 1 week ago 8.1K runs

videocrafter

Text-to-Video Generation and Editing

Updated 5 months, 1 week ago 1.2K runs

text2video-zero

Text-to-Image Diffusion Models are Zero-Shot Video Generators

Updated 5 months, 2 weeks ago 35.3K runs

pix2struct

Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding

Updated 5 months, 3 weeks ago 5.2K runs

dolly

Fine-tuned GPT-J 6B model on the Alpaca dataset

Updated 5 months, 3 weeks ago 943 runs

stable-diffusion-2-1-unclip

Stable Diffusion v2-1-unclip Model

Updated 5 months, 3 weeks ago 1.6K runs

damo-text-to-video

Multi-stage text-to-video generation

Updated 5 months, 4 weeks ago 80.6K runs

unidiffuser

One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale

Updated 6 months ago 789 runs

dreamshaper

Dream Shaper stable diffusion

Updated 6 months, 1 week ago 868.8K runs

zoedepth

ZoeDepth: Combining relative and metric depth

Updated 6 months, 2 weeks ago 1.5M runs

hasdx

mixed stable diffusion model

Updated 6 months, 2 weeks ago 28.6K runs

supermarionation

Finetuned Stable-diffusion from Gerry Anderson Supermarionation

Updated 6 months, 2 weeks ago 1.6K runs

pastel-mix

high-quality highly detailed anime stylized latent diffusion model

Updated 7 months ago 27.7K runs

sd-x2-latent-upscaler

Stable Diffusion x2 latent upscaler

Updated 7 months ago 2K runs

real-esrgan

Real-ESRGAN: Real-World Blind Super-Resolution

Updated 7 months ago 720.2K runs

t2i-adapter

Learning Adapters towards Controllable for Text-to-Image Diffusion Models

Updated 7 months ago 2.4K runs

midas

Robust Monocular Depth Estimation

Updated 7 months, 1 week ago 9.4K runs

hard-prompts-made-easy

Gradient-Based Discrete Optimization for Prompt Tuning and Discovery

Updated 7 months, 1 week ago 567 runs

pix2pix-zero

Zero-shot Image-to-Image Translation

Updated 7 months, 1 week ago 3.6K runs

dreambooth-avatar

Dreambooth finetuning of Stable Diffusion (v1.5.1) on Avatar art style by Lambda Labs

Updated 7 months, 2 weeks ago 518 runs

gta5_artwork_diffusion

GTA5 Artwork Diffusion via Dreambooth

Updated 7 months, 2 weeks ago 4.5K runs

magifactory-t-shirt-diffusion

Generate t-shirt logos with stable-dfffusion

Updated 7 months, 2 weeks ago 169.3K runs

distilgpt2-stable-diffusion-v2

Descriptive stable diffusion prompts generation using GPT2

Updated 7 months, 3 weeks ago 392 runs

portraitplus

Portraits with stable-diffusion

Updated 7 months, 4 weeks ago 21K runs

anything-v4.0

high-quality, highly detailed anime-style Stable Diffusion models

Updated 7 months, 4 weeks ago 1.7M runs

point-e

Point-E: A System for Generating 3D Point Clouds from Complex Prompts

Updated 8 months ago 6.6K runs

anything-v3-better-vae

high-quality, highly detailed anime style stable-diffusion with better VAE

Updated 8 months ago 3M runs

future-diffusion

Finte-tuned Stable Diffusion on high quality 3D images with a futuristic Sci-Fi theme

Updated 8 months, 2 weeks ago 4.9K runs

karlo

Text-conditional image generation model based on OpenAI's unCLIP

Updated 8 months, 2 weeks ago 838 runs

analog-diffusion

a dreambooth model trained on a diverse set of analog photographs

Updated 8 months, 3 weeks ago 221.7K runs

taiyi-stable-diffusion-1b-chinese-v0.1

Chinese Stable diffusion model

Updated 8 months, 3 weeks ago 775 runs

eimis_anime_diffusion

stable-diffusion models for high quality and detailed anime images

Updated 8 months, 3 weeks ago 11.3K runs

anything-v3.0

high-quality, highly detailed anime style stable-diffusion

Updated 9 months ago 330K runs

whisper

with large-v2 checkpoint

Updated 9 months ago 3.9K runs

stable-diffusion-img2img-v2.1

Updated 9 months ago 12.2K runs

wavyfusion

dreambooth trained on a very diverse dataset ranging from photographs to paintings

Updated 9 months, 1 week ago 3.5K runs

altdiffusion-m9

Multilingual Stable Diffusion

Updated 9 months, 2 weeks ago 591 runs

stable-diffusion-v2

sd-v2 with diffusers, test version!

Updated 9 months, 2 weeks ago 258.3K runs

stable-diffusion-v2-inpainting

stable-diffusion-v2-inpainting

Updated 9 months, 2 weeks ago 13.3K runs

rembg

remove images background

Updated 9 months, 3 weeks ago 1.5M runs

stable-diffusion

stable-diffusion with negative prompts, more scheduler

Updated 9 months, 4 weeks ago 65K runs

app_icons_generator

App Icons Generator V1 (DreamBooth Model)

Updated 10 months ago 1.8K runs

aesthetic-predictor

A linear estimator on top of clip to predict the aesthetic quality of pictures

Updated 10 months ago 7.5K runs

backgroundmatting

Real-Time High-Resolution Background Matting

Updated 10 months ago 2.1K runs

sd_pixelart_spritesheet_generator

generate pixel art sprite sheets from four different angles with Stable-diffusion

Updated 10 months, 1 week ago 3.9K runs

disco-diffusion-style

Disco Diffusion style on Stable Diffusion via Dreambooth

Updated 10 months, 1 week ago 3K runs

tron-legacy-diffusion

Tron Legacy Diffusion on Stable Diffusion via Dreambooth

Updated 10 months, 1 week ago 1.5K runs

dreambooth-pikachu

Pikachu on Stable Diffusion via Dreambooth

Updated 10 months, 1 week ago 474 runs

herge-style

herge_style on Stable Diffusion via Dreambooth

Updated 10 months, 2 weeks ago 1.9K runs

van-gogh-diffusion

Van Gough on Stable Diffusion via Dreambooth

Updated 10 months, 2 weeks ago 5.1K runs

elden-ring-diffusion

fine-tuned Stable Diffusion model trained on the game art from Elden Ring

Updated 10 months, 2 weeks ago 6.7K runs

prompt-to-prompt

Prompt-to-prompt image editing with cross-attention control

Updated 10 months, 3 weeks ago 1.5K runs

stable-diffusion-v1-5

stable-diffusion with v1-5 checkpoint

Updated 10 months, 3 weeks ago 34K runs

stable-diffusion-aesthetic-gradients

Stable Diffusion with Aesthetic Gradients

Updated 10 months, 3 weeks ago 338 runs

waifu-diffusion

Stable Diffusion on Danbooru images

Updated 11 months, 1 week ago 1.1M runs

whisper-downloadable-subtitles

Added downloadable subtitles for openai/whisper

Updated 11 months, 2 weeks ago 1.5K runs

rudalle-sr

Real-ESRGAN super-resolution model from ruDALL-E

Updated 11 months, 3 weeks ago 348.6K runs

stable-diffusion-high-resolution

Detailed, higher-resolution images from Stable Diffusion

Updated 11 months, 3 weeks ago 65.2K runs

clip-vit-large-patch14

openai/clip-vit-large-patch14 with Transformers

Updated 1 year ago 2.7M runs

sd-textual-inversion-ugly-sonic

stable-diffusion-textual-inversion fine-tuned with ugly sonic

Updated 1 year ago 1.9K runs

sd-textual-inversion-spyro-dragon

stable-diffusion-textual-inversion fine-tuned with spyro of the dragon STYLE

Updated 1 year ago 434 runs

sd-textual-inversion

Stable Diffusion Textual Inversion

Updated 1 year ago 468 runs

docentr

End-to-End Document Image Enhancement Transformer

Updated 1 year ago 1.3K runs

style-your-hair

Pose-Invariant Hairstyle Transfer

Updated 1 year, 1 month ago 6K runs

repaint

Inpainting using Denoising Diffusion Probabilistic Models

Updated 1 year, 1 month ago 2.3K runs

night-enhancement

Unsupervised Night Image Enhancement

Updated 1 year, 1 month ago 18.2K runs

latent-diffusion-text2img

text-to-image with latent diffusion

Updated 1 year, 1 month ago 3.8K runs

openpsg

Panoptic Scene Graph Generation

Updated 1 year, 1 month ago 672 runs

mindall-e

text-to-image generation

Updated 1 year, 1 month ago 1.6K runs

vq-diffusion

VQ-Diffusion for Text-to-Image Synthesis

Updated 1 year, 1 month ago 20.6K runs

compositional-vsual-generation-with-composable-diffusion-models-pytorch

Composable Diffusion

Updated 1 year, 1 month ago 762 runs

micromotion-stylegan

Decoding Micromotion in Low-dimensional Latent Spaces from StyleGAN

Updated 1 year, 1 month ago 6.1K runs

clip-gen

Language-Free Training of a Text-to-Image Generator with CLIP

Updated 1 year, 1 month ago 802 runs

bigcolor

Colorization using a Generative Color Prior for Natural Images

Updated 1 year, 1 month ago 236.3K runs

global_tracking_transformers

Global Tracking Transformers

Updated 1 year, 1 month ago 117 runs

vqfr

Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder

Updated 1 year, 1 month ago 91.7K runs

diffae

Image Manipulatinon with Diffusion Autoencoders

Updated 1 year, 1 month ago 11.3K runs

face-align-cog

face alignment using stylegan-encoding

Updated 1 year, 3 months ago 2.6K runs

clip-guided-diffusion

Clip-Guided Diffusion Model for Image Generation

Updated 1 year, 6 months ago 4.5K runs

clip-guided-diffusion-pokemon

Generates pokemon sprites from prompt

Updated 1 year, 6 months ago 4.7K runs

multilingual-stable-diffusion

No versions pushed 0 runs

maskgit

Masked Generative Image Transformer

No versions pushed 1 run

oneformer

One Transformer to Rule Universal Image Segmentation

No versions pushed 0 runs

ddnm

Zero Shot Image Restoration Using Denoising Diffusion Null-Space Model

No versions pushed 0 runs

chatglm-6b

bilingual language model based on General Language Model (GLM) framework

No versions pushed 0 runs

pix2seq

Turning RGB pixels into semantically meaningful sequences

No versions pushed 0 runs