chenxwh / omnigen

OmniGen: Unified Image Generation

2.4K runs
Public

chenxwh / meissonic

Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

31 runs
Public

chenxwh / depth-any-video

Depth Any Video with Scalable Synthetic Data

122 runs
Public

chenxwh / hart

Efficient Visual Generation with Hybrid Autoregressive Transformer

110 runs
Public

chenxwh / cogview3

Finer and Faster Text-to-Image Generation via Relay Diffusion

32 runs
Public

chenxwh / ml-depth-pro

Sharp Monocular Metric Depth in Less Than a Second

163 runs
Public

chenxwh / lotus

Diffusion-based Visual Foundation Model for High-quality Dense Prediction

99 runs
Public

chenxwh / depthcrafter

Generating Consistent Long Depth Sequences for Open-world Videos

94 runs
Public

chenxwh / cogvlm2-video

CogVLM2: Visual Language Models for Image and Video Understanding

343.2K runs
Public

chenxwh / cogvlm2

CogVLM2: Visual Language Models for Image and Video Understanding

375 runs
Public

chenxwh / diffsynth-exvideo

Extended video synthesis model that generates 128 frames

197 runs
Public

chenxwh / omost

Convert LLM's coding to image generation

1.8K runs
Public

cjwbw / sadtalker

Stylized Audio-Driven Single Image Talking Face Animation

112.5K runs
Public

chenxwh / sdxl-flash

Fast sdxl with higher quality

572.8K runs
Public

chenxwh / hunyuandit

A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

322 runs
Public

chenxwh / openvoice

Updated to OpenVoice v2: Versatile Instant Voice Cloning

36.4K runs
Public

cjwbw / hyper-sdxl-1step-t2i

Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis

1.4K runs
Public

cjwbw / voicecraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

9.3K runs
Public

cjwbw / parler-tts

lightweight text-to-speech (TTS) model, trained on 10.5K hours of audio data

1.3K runs
Public

cjwbw / pixart-sigma

Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

6K runs
Public

cjwbw / aniportrait-audio2vid

Audio-Driven Synthesis of Photorealistic Portrait Animations

8.1K runs
Public

cjwbw / animagine-xl-3.1

Anime-themed text-to-image stable diffusion model

849.5K runs
Public

cjwbw / starcoder2-15b

Language Models for Code

238 runs
Public

cjwbw / tcs-sdxl-lora

Trajectory Consistency Distillation

565 runs
Public

cjwbw / melotts

High-quality multilingual text-to-speech library

1.2K runs
Public

cjwbw / opencodeinterpreter-ds-6.7b

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

101 runs
Public

cjwbw / supir-v0f

Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. This is the SUPIR-v0F model and does NOT use LLaVA-13b.

10.3K runs
Public

cjwbw / supir-v0q

Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. This is the SUPIR-v0Q model and does NOT use LLaVA-13b.

7.2K runs
Public

cjwbw / supir

Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. This version uses LLaVA-13b for captioning.

157K runs
Public

cjwbw / uform-gen2-qwen-500m

Pocket-Sized Multimodal AI For Content Understanding and Generation

390 runs
Public

cjwbw / lambda-eclipse

λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space

171 runs
Public

cjwbw / blipdiffusion

Pre-trained Subject Representation for Controllable Text-to-Image Generation and Editing

338 runs
Public

cjwbw / blipdiffusion-controlnet

Pre-trained Subject Representation for Controllable Text-to-Image Generation and Editing with ControlNet

163 runs
Public

cjwbw / rmgb

Background removal model developed by BRIA.AI, trained on a carefully selected dataset and is available as an open-source model for non-commercial use.

48.3K runs
Public

cjwbw / cogagent-chat

A Visual Language Model for GUI Agents

2.2K runs
Public

cjwbw / videocrafter

VideoCrafter2: Text-to-Video and Image-to-Video Generation and Editing

33.7K runs
Public

cjwbw / depth-anything

Highly practical solution for robust monocular depth estimation by training on a combination of 1.5M labeled images and 62M+ unlabeled images

4.6K runs
Public

cjwbw / tokenflow

Consistent Diffusion Features for Consistent Video Editing

2K runs
Public

chenxwh / video-retalking

Audio-based Lip Synchronization for Talking Head Video

25.9K runs
Public

cjwbw / diffmorpher

Diffusion Models for Image Morphing

992 runs
Public

cjwbw / dreamtalk

RESEARCH/NON-COMMERCIAL USE ONLY: diffusion-based audio-driven expressive talking head generation

984 runs
Public

cjwbw / faster-diffusion

Rethinking the Role of UNet Encoder in Diffusion Models

132 runs
Public

cjwbw / magicoder

LLMs with open-source code snippets for generating low-bias and high-quality instruction data for code.

349 runs
Public

cjwbw / segmind-vega

Open-source Distilled Stable Diffusion 100% speedup

1.7K runs
Public

cjwbw / segmind-vegart

Fast Segmind-Vega with 2-8 inference steps.

762 runs
Public

cjwbw / cogvlm

powerful open-source visual language model

593.7K runs
Public

cjwbw / kandinskyvideo

text-to-video generation model

1.2K runs
Public

cjwbw / lavie

High-Quality Video Generation with Cascaded Latent Diffusion Models

13.1K runs
Public

cjwbw / gorilla

Gorilla: Large Language Model Connected with Massive APIs

88 runs
Public

cjwbw / distil-whisper

Distilled version of Whisper

272 runs
Public

cjwbw / cutie

Video Object Segmentation, combined with SAM and ProPainter

246 runs
Public

cjwbw / audiosep

Separate Anything You Describe

3.3K runs
Public

cjwbw / scalecrafter

Tuning-free Higher-Resolution Visual Generation with Diffusion Models

1.1K runs
Public

cjwbw / show-1

Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

967 runs
Public

cjwbw / daclip-uir

Controlling Vision-Language Models for Universal Image Restoration

2.1K runs
Public

cjwbw / instructcv

Instruction tuned text-to-image diffusion models as vision generalists

356 runs
Public

cjwbw / internlm-xcomposer

Advanced text-image comprehension and composition based on InternLM

164.2K runs
Public

cjwbw / wuerstchen

Efficient Pretraining of Text-to-Image Models

4.2K runs
Public

cjwbw / seamless_​communication

SeamlessM4T—Massively Multilingual & Multimodal Machine Translation

78.9K runs
Public

cjwbw / unival

Unified Model for Image, Video, Audio and Language Tasks

930 runs
Public

cjwbw / lorahub

Efficient Cross-Task Generalization via Dynamic LoRA Composition

80 runs
Public

cjwbw / resshift

Efficient Diffusion Model for Image Super-resolution by Residual Shifting

2.5K runs
Public

cjwbw / ledits

Real Image Editing with DDPM Inversion and Semantic Guidance

927 runs
Public

cjwbw / kandinsky-2-2-controlnet-depth

Kandinsky Image Generation with ControlNet Conditioning

3.7K runs
Public

cjwbw / demucs

Demucs Music Source Separation

142.9K runs
Public

cjwbw / diffedit-stable-diffusion

Diffusion-based semantic image editing with mask guidance

401 runs
Public

cjwbw / textdiffuser

Diffusion Models as Text Painters

1.7K runs
Public

cjwbw / prompt-free-diffusion

Prompt-free Diffusion

738 runs
Public

cjwbw / controlvideo

Training-free Controllable Text-to-Video Generation

2.1K runs
Public

cjwbw / shap-e

Generating Conditional 3D Implicit Functions

14.7K runs
Public

cjwbw / semantic-segment-anything

Adding semantic labels for segment anything

24.2K runs
Public

cjwbw / text2video-zero

Text-to-Image Diffusion Models are Zero-Shot Video Generators

41.3K runs
Public

cjwbw / pix2struct

Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding

6K runs
Public

cjwbw / dolly

Fine-tuned GPT-J 6B model on the Alpaca dataset

976 runs
Public

cjwbw / stable-diffusion-2-1-unclip

Stable Diffusion v2-1-unclip Model

2.3K runs
Public

cjwbw / damo-text-to-video

Multi-stage text-to-video generation

140.6K runs
Public

cjwbw / unidiffuser

One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale

1.1K runs
Public

cjwbw / dreamshaper

Dream Shaper stable diffusion

1.3M runs
Public

cjwbw / zoedepth

ZoeDepth: Combining relative and metric depth

4.1M runs
Public

cjwbw / hasdx

mixed stable diffusion model

30K runs
Public

cjwbw / supermarionation

Finetuned Stable-diffusion from Gerry Anderson Supermarionation

1.9K runs
Public

cjwbw / pastel-mix

high-quality highly detailed anime stylized latent diffusion model

30.9K runs
Public

cjwbw / real-esrgan

Real-ESRGAN: Real-World Blind Super-Resolution

1.8M runs
Public

cjwbw / t2i-adapter

Learning Adapters towards Controllable for Text-to-Image Diffusion Models

3.9K runs
Public

cjwbw / midas

Robust Monocular Depth Estimation

249.1K runs
Public

cjwbw / hard-prompts-made-easy

Gradient-Based Discrete Optimization for Prompt Tuning and Discovery

640 runs
Public

cjwbw / pix2pix-zero

Zero-shot Image-to-Image Translation

6.1K runs
Public

cjwbw / dreambooth-avatar

Dreambooth finetuning of Stable Diffusion (v1.5.1) on Avatar art style by Lambda Labs

576 runs
Public

cjwbw / gta5_​artwork_​diffusion

GTA5 Artwork Diffusion via Dreambooth

4.9K runs
Public

cjwbw / magifactory-t-shirt-diffusion

Generate t-shirt logos with stable-dfffusion

182.1K runs
Public

cjwbw / distilgpt2-stable-diffusion-v2

Descriptive stable diffusion prompts generation using GPT2

603 runs
Public

cjwbw / portraitplus

Portraits with stable-diffusion

24K runs
Public

cjwbw / anything-v4.0

high-quality, highly detailed anime-style Stable Diffusion models

3.3M runs
Public

cjwbw / point-e

Point-E: A System for Generating 3D Point Clouds from Complex Prompts

8.7K runs
Public

cjwbw / anything-v3-better-vae

high-quality, highly detailed anime style stable-diffusion with better VAE

3.4M runs
Public

cjwbw / future-diffusion

Finte-tuned Stable Diffusion on high quality 3D images with a futuristic Sci-Fi theme

5.4K runs
Public

cjwbw / karlo

Text-conditional image generation model based on OpenAI's unCLIP

1.1K runs
Public

cjwbw / analog-diffusion

a dreambooth model trained on a diverse set of analog photographs

234.3K runs
Public

cjwbw / taiyi-stable-diffusion-1b-chinese-v0.1

Chinese Stable diffusion model

961 runs
Public

cjwbw / eimis_​anime_​diffusion

stable-diffusion models for high quality and detailed anime images

13.1K runs
Public

cjwbw / anything-v3.0

high-quality, highly detailed anime style stable-diffusion

353.7K runs
Public

cjwbw / whisper

with large-v2 checkpoint

53.5K runs
Public

cjwbw / stable-diffusion-img2img-v2.1

13.4K runs
Public

cjwbw / wavyfusion

dreambooth trained on a very diverse dataset ranging from photographs to paintings

3.7K runs
Public

cjwbw / altdiffusion-m9

Multilingual Stable Diffusion

618 runs
Public

cjwbw / stable-diffusion-v2

sd-v2 with diffusers, test version!

279.4K runs
Public

cjwbw / stable-diffusion-v2-inpainting

stable-diffusion-v2-inpainting

81.7K runs
Public

cjwbw / rembg

Remove images background

7.2M runs
Public

cjwbw / app_​icons_​generator

App Icons Generator V1 (DreamBooth Model)

2.2K runs
Public

cjwbw / aesthetic-predictor

A linear estimator on top of clip to predict the aesthetic quality of pictures

8.1K runs
Public

cjwbw / backgroundmatting

Real-Time High-Resolution Background Matting

2.7K runs
Public

cjwbw / sd_​pixelart_​spritesheet_​generator

generate pixel art sprite sheets from four different angles with Stable-diffusion

4.8K runs
Public

cjwbw / disco-diffusion-style

Disco Diffusion style on Stable Diffusion via Dreambooth

3.5K runs
Public

cjwbw / dreambooth-pikachu

Pikachu on Stable Diffusion via Dreambooth

519 runs
Public

cjwbw / herge-style

herge_style on Stable Diffusion via Dreambooth

2.2K runs
Public

cjwbw / van-gogh-diffusion

Van Gough on Stable Diffusion via Dreambooth

5.5K runs
Public

cjwbw / elden-ring-diffusion

fine-tuned Stable Diffusion model trained on the game art from Elden Ring

6.9K runs
Public

cjwbw / prompt-to-prompt

Prompt-to-prompt image editing with cross-attention control

1.8K runs
Public

cjwbw / stable-diffusion-v1-5

stable-diffusion with v1-5 checkpoint

35.1K runs
Public

cjwbw / stable-diffusion-aesthetic-gradients

Stable Diffusion with Aesthetic Gradients

355 runs
Public

cjwbw / waifu-diffusion

Stable Diffusion on Danbooru images

1.1M runs
Public

cjwbw / stable-diffusion

stable-diffusion with negative prompts, more scheduler

65.3K runs
Public

cjwbw / whisper-downloadable-subtitles

Added downloadable subtitles for openai/whisper

2.1K runs
Public

cjwbw / rudalle-sr

Real-ESRGAN super-resolution model from ruDALL-E

481.8K runs
Public

cjwbw / stable-diffusion-high-resolution

Detailed, higher-resolution images from Stable Diffusion

72.9K runs
Public

cjwbw / clip-vit-large-patch14

openai/clip-vit-large-patch14 with Transformers

6.1M runs
Public

cjwbw / sd-textual-inversion-ugly-sonic

stable-diffusion-textual-inversion fine-tuned with ugly sonic

2K runs
Public

cjwbw / sd-textual-inversion-spyro-dragon

stable-diffusion-textual-inversion fine-tuned with spyro of the dragon STYLE

477 runs
Public

cjwbw / docentr

End-to-End Document Image Enhancement Transformer

3.7K runs
Public

cjwbw / style-your-hair

Pose-Invariant Hairstyle Transfer

9.3K runs
Public

cjwbw / repaint

Inpainting using Denoising Diffusion Probabilistic Models

3.9K runs
Public

cjwbw / night-enhancement

Unsupervised Night Image Enhancement

41.4K runs
Public

cjwbw / latent-diffusion-text2img

text-to-image with latent diffusion

4.1K runs
Public

cjwbw / openpsg

Panoptic Scene Graph Generation

1.3K runs
Public

cjwbw / mindall-e

text-to-image generation

1.7K runs
Public

cjwbw / vq-diffusion

VQ-Diffusion for Text-to-Image Synthesis

20.7K runs
Public

cjwbw / compositional-vsual-generation-with-composable-diffusion-models-pytorch

Composable Diffusion

845 runs
Public

cjwbw / micromotion-stylegan

Decoding Micromotion in Low-dimensional Latent Spaces from StyleGAN

8.2K runs
Public

cjwbw / clip-gen

Language-Free Training of a Text-to-Image Generator with CLIP

955 runs
Public

cjwbw / bigcolor

Colorization using a Generative Color Prior for Natural Images

516.9K runs
Public

cjwbw / global_​tracking_​transformers

Global Tracking Transformers

143 runs
Public

cjwbw / vqfr

Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder

139.6K runs
Public

cjwbw / diffae

Image Manipulatinon with Diffusion Autoencoders

16.9K runs
Public

cjwbw / face-align-cog

face alignment using stylegan-encoding

4.3K runs
Public

cjwbw / clip-guided-diffusion

Clip-Guided Diffusion Model for Image Generation

4.5K runs
Public

cjwbw / clip-guided-diffusion-pokemon

Generates pokemon sprites from prompt

4.9K runs
Public

cjwbw / chatglm-6b

bilingual language model based on General Language Model (GLM) framework

0 runs
Public

cjwbw / pixart-dmd

0 runs
Public

cjwbw / ddnm

Zero Shot Image Restoration Using Denoising Diffusion Null-Space Model

0 runs
Public

cjwbw / multilingual-stable-diffusion

0 runs
Public

cjwbw / maskgit

Masked Generative Image Transformer

1 run
Public

chenxwh / depth-anything-v2

Depth estimation with faster inference speed, fewer parameters, and higher depth accuracy.

191.9K runs
Public

cjwbw / sd-textual-inversion

Stable Diffusion Textual Inversion

484 runs
Public

cjwbw / fastcomposer

Tuning-Free Multi-Subject Image Generation with Localized Attention

34.1K runs
Public

cjwbw / oneformer

One Transformer to Rule Universal Image Segmentation

0 runs
Public

cjwbw / videocrafter2

0 runs
Public

cjwbw / styledrop

Text-to-Image Generation in Any Style

1.2K runs
Public

cjwbw / tron-legacy-diffusion

Tron Legacy Diffusion on Stable Diffusion via Dreambooth

1.5K runs
Public

cjwbw / sd-x2-latent-upscaler

Stable Diffusion x2 latent upscaler

2K runs
Public

cjwbw / chronos

0 runs
Public

cjwbw / rpg-diffusionmaster

0 runs
Public

cjwbw / pix2seq

Turning RGB pixels into semantically meaningful sequences

0 runs
Public

cjwbw / minigpt-5

0 runs
Public

cjwbw / starcoder2

0 runs
Public

cjwbw / c4ai-command-r-v01

CohereForAI c4ai-command-r-v01, Quantized model through bitsandbytes, 8-bit precision

54 runs
Public

cjwbw / idefics

Open-access reproduction of large visual language model Flamingo

855 runs
Public

cjwbw / transfer-anything

0 runs
Public

cjwbw / canary-1b

Nvidia Automatic speech-to-text recognition (ASR) in 4 languages (English, German, French, Spanish)

277 runs
Public