cjwbw/voicecraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

804 runs
Public

cjwbw/parler-tts

lightweight text-to-speech (TTS) model, trained on 10.5K hours of audio data

161 runs
Public

cjwbw/pixart-sigma

Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

409 runs
Public

cjwbw/aniportrait-audio2vid

Audio-Driven Synthesis of Photorealistic Portrait Animations

608 runs
Public

cjwbw/animagine-xl-3.1

Anime-themed text-to-image stable diffusion model

9.2K runs
Public

cjwbw/starcoder2-15b

Language Models for Code

75 runs
Public

cjwbw/tcs-sdxl-lora

Trajectory Consistency Distillation

414 runs
Public

cjwbw/melotts

High-quality multilingual text-to-speech library

265 runs
Public

cjwbw/opencodeinterpreter-ds-6.7b

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

89 runs
Public

cjwbw/supir-v0f

Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. This is the SUPIR-v0F model and does NOT use LLaVA-13b.

5K runs
Public

cjwbw/supir-v0q

Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. This is the SUPIR-v0Q model and does NOT use LLaVA-13b.

3.5K runs
Public

cjwbw/supir

Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. This version uses LLaVA-13b for captioning.

75.2K runs
Public

cjwbw/uform-gen2-qwen-500m

Pocket-Sized Multimodal AI For Content Understanding and Generation

349 runs
Public

cjwbw/canary-1b

Nvidia Automatic speech-to-text recognition (ASR) in 4 languages (English, German, French, Spanish)

145 runs
Public

cjwbw/lambda-eclipse

λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space

149 runs
Public

cjwbw/blipdiffusion

Pre-trained Subject Representation for Controllable Text-to-Image Generation and Editing

318 runs
Public

cjwbw/blipdiffusion-controlnet

Pre-trained Subject Representation for Controllable Text-to-Image Generation and Editing with ControlNet

155 runs
Public

cjwbw/rmgb

Background removal model developed by BRIA.AI, trained on a carefully selected dataset and is available as an open-source model for non-commercial use.

1.3K runs
Public

cjwbw/cogagent-chat

A Visual Language Model for GUI Agents

1.8K runs
Public

cjwbw/videocrafter

VideoCrafter2: Text-to-Video and Image-to-Video Generation and Editing

12K runs
Public

cjwbw/depth-anything

Highly practical solution for robust monocular depth estimation by training on a combination of 1.5M labeled images and 62M+ unlabeled images

3.3K runs
Public

cjwbw/tokenflow

Consistent Diffusion Features for Consistent Video Editing

1.9K runs
Public

cjwbw/video-retalking

Audio-based Lip Synchronization for Talking Head Video

16.3K runs
Public

cjwbw/diffmorpher

Diffusion Models for Image Morphing

677 runs
Public

cjwbw/dreamtalk

RESEARCH/NON-COMMERCIAL USE ONLY: diffusion-based audio-driven expressive talking head generation

613 runs
Public

cjwbw/openvoice

NON-COMMERCIAL USE ONLY: Versatile Instant Voice Cloning

1.4K runs
Public

cjwbw/faster-diffusion

Rethinking the Role of UNet Encoder in Diffusion Models

130 runs
Public

cjwbw/magicoder

LLMs with open-source code snippets for generating low-bias and high-quality instruction data for code.

333 runs
Public

cjwbw/segmind-vega

Open-source Distilled Stable Diffusion 100% speedup

1.6K runs
Public

cjwbw/segmind-vegart

Fast Segmind-Vega with 2-8 inference steps.

754 runs
Public

cjwbw/cogvlm

powerful open-source visual language model

533.7K runs
Public

cjwbw/kandinskyvideo

text-to-video generation model

1.1K runs
Public

cjwbw/lavie

High-Quality Video Generation with Cascaded Latent Diffusion Models

12.6K runs
Public

cjwbw/gorilla

Gorilla: Large Language Model Connected with Massive APIs

84 runs
Public

cjwbw/distil-whisper

Distilled version of Whisper

243 runs
Public

cjwbw/cutie

Video Object Segmentation, combined with SAM and ProPainter

188 runs
Public

cjwbw/audiosep

Separate Anything You Describe

2.1K runs
Public

cjwbw/scalecrafter

Tuning-free Higher-Resolution Visual Generation with Diffusion Models

1K runs
Public

cjwbw/show-1

Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

897 runs
Public

cjwbw/daclip-uir

Controlling Vision-Language Models for Universal Image Restoration

1.7K runs
Public

cjwbw/instructcv

Instruction tuned text-to-image diffusion models as vision generalists

305 runs
Public

cjwbw/internlm-xcomposer

Advanced text-image comprehension and composition based on InternLM

163.8K runs
Public

cjwbw/wuerstchen

Efficient Pretraining of Text-to-Image Models

3.8K runs
Public

cjwbw/seamless_​communication

SeamlessM4T—Massively Multilingual & Multimodal Machine Translation

32.5K runs
Public

cjwbw/unival

Unified Model for Image, Video, Audio and Language Tasks

770 runs
Public

cjwbw/lorahub

Efficient Cross-Task Generalization via Dynamic LoRA Composition

77 runs
Public

cjwbw/resshift

Efficient Diffusion Model for Image Super-resolution by Residual Shifting

1.8K runs
Public

cjwbw/ledits

Real Image Editing with DDPM Inversion and Semantic Guidance

909 runs
Public

cjwbw/kandinsky-2-2-controlnet-depth

Kandinsky Image Generation with ControlNet Conditioning

3.7K runs
Public

cjwbw/demucs

Demucs Music Source Separation

97.2K runs
Public

cjwbw/diffedit-stable-diffusion

Diffusion-based semantic image editing with mask guidance

378 runs
Public

cjwbw/textdiffuser

Diffusion Models as Text Painters

1.7K runs
Public

cjwbw/prompt-free-diffusion

Prompt-free Diffusion

709 runs
Public

cjwbw/controlvideo

Training-free Controllable Text-to-Video Generation

1.9K runs
Public

cjwbw/shap-e

Generating Conditional 3D Implicit Functions

13.1K runs
Public

cjwbw/fastcomposer

Tuning-Free Multi-Subject Image Generation with Localized Attention

33.8K runs
Public

cjwbw/sadtalker

Stylized Audio-Driven Single Image Talking Face Animation

70.4K runs
Public

cjwbw/semantic-segment-anything

Adding semantic labels for segment anything

18.5K runs
Public

cjwbw/text2video-zero

Text-to-Image Diffusion Models are Zero-Shot Video Generators

40.1K runs
Public

cjwbw/pix2struct

Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding

5.9K runs
Public

cjwbw/dolly

Fine-tuned GPT-J 6B model on the Alpaca dataset

962 runs
Public

cjwbw/stable-diffusion-2-1-unclip

Stable Diffusion v2-1-unclip Model

2.1K runs
Public

cjwbw/damo-text-to-video

Multi-stage text-to-video generation

123.5K runs
Public

cjwbw/unidiffuser

One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale

1.1K runs
Public

cjwbw/dreamshaper

Dream Shaper stable diffusion

1.2M runs
Public

cjwbw/zoedepth

ZoeDepth: Combining relative and metric depth

3.3M runs
Public

cjwbw/hasdx

mixed stable diffusion model

29.5K runs
Public

cjwbw/supermarionation

Finetuned Stable-diffusion from Gerry Anderson Supermarionation

1.8K runs
Public

cjwbw/pastel-mix

high-quality highly detailed anime stylized latent diffusion model

30.7K runs
Public

cjwbw/real-esrgan

Real-ESRGAN: Real-World Blind Super-Resolution

1.4M runs
Public

cjwbw/t2i-adapter

Learning Adapters towards Controllable for Text-to-Image Diffusion Models

3.8K runs
Public

cjwbw/midas

Robust Monocular Depth Estimation

70.9K runs
Public

cjwbw/hard-prompts-made-easy

Gradient-Based Discrete Optimization for Prompt Tuning and Discovery

618 runs
Public

cjwbw/pix2pix-zero

Zero-shot Image-to-Image Translation

5.7K runs
Public

cjwbw/dreambooth-avatar

Dreambooth finetuning of Stable Diffusion (v1.5.1) on Avatar art style by Lambda Labs

566 runs
Public

cjwbw/gta5_​artwork_​diffusion

GTA5 Artwork Diffusion via Dreambooth

4.8K runs
Public

cjwbw/magifactory-t-shirt-diffusion

Generate t-shirt logos with stable-dfffusion

181.6K runs
Public

cjwbw/distilgpt2-stable-diffusion-v2

Descriptive stable diffusion prompts generation using GPT2

567 runs
Public

cjwbw/portraitplus

Portraits with stable-diffusion

23.4K runs
Public

cjwbw/anything-v4.0

high-quality, highly detailed anime-style Stable Diffusion models

3M runs
Public

cjwbw/point-e

Point-E: A System for Generating 3D Point Clouds from Complex Prompts

8.5K runs
Public

cjwbw/anything-v3-better-vae

high-quality, highly detailed anime style stable-diffusion with better VAE

3.4M runs
Public

cjwbw/future-diffusion

Finte-tuned Stable Diffusion on high quality 3D images with a futuristic Sci-Fi theme

5.3K runs
Public

cjwbw/karlo

Text-conditional image generation model based on OpenAI's unCLIP

887 runs
Public

cjwbw/analog-diffusion

a dreambooth model trained on a diverse set of analog photographs

233.9K runs
Public

cjwbw/taiyi-stable-diffusion-1b-chinese-v0.1

Chinese Stable diffusion model

941 runs
Public

cjwbw/eimis_​anime_​diffusion

stable-diffusion models for high quality and detailed anime images

12.2K runs
Public

cjwbw/anything-v3.0

high-quality, highly detailed anime style stable-diffusion

352.7K runs
Public

cjwbw/whisper

with large-v2 checkpoint

48.9K runs
Public

cjwbw/stable-diffusion-img2img-v2.1

13.2K runs
Public

cjwbw/wavyfusion

dreambooth trained on a very diverse dataset ranging from photographs to paintings

3.7K runs
Public

cjwbw/altdiffusion-m9

Multilingual Stable Diffusion

602 runs
Public

cjwbw/stable-diffusion-v2

sd-v2 with diffusers, test version!

272.8K runs
Public

cjwbw/stable-diffusion-v2-inpainting

stable-diffusion-v2-inpainting

39.9K runs
Public

cjwbw/rembg

Remove images background

5.2M runs
Public

cjwbw/app_​icons_​generator

App Icons Generator V1 (DreamBooth Model)

2.1K runs
Public

cjwbw/aesthetic-predictor

A linear estimator on top of clip to predict the aesthetic quality of pictures

8.1K runs
Public

cjwbw/backgroundmatting

Real-Time High-Resolution Background Matting

2.6K runs
Public

cjwbw/sd_​pixelart_​spritesheet_​generator

generate pixel art sprite sheets from four different angles with Stable-diffusion

4.6K runs
Public

cjwbw/disco-diffusion-style

Disco Diffusion style on Stable Diffusion via Dreambooth

3.3K runs
Public

cjwbw/dreambooth-pikachu

Pikachu on Stable Diffusion via Dreambooth

513 runs
Public

cjwbw/herge-style

herge_style on Stable Diffusion via Dreambooth

2.2K runs
Public

cjwbw/van-gogh-diffusion

Van Gough on Stable Diffusion via Dreambooth

5.4K runs
Public

cjwbw/elden-ring-diffusion

fine-tuned Stable Diffusion model trained on the game art from Elden Ring

6.9K runs
Public

cjwbw/prompt-to-prompt

Prompt-to-prompt image editing with cross-attention control

1.7K runs
Public

cjwbw/stable-diffusion-v1-5

stable-diffusion with v1-5 checkpoint

34.5K runs
Public

cjwbw/stable-diffusion-aesthetic-gradients

Stable Diffusion with Aesthetic Gradients

353 runs
Public

cjwbw/waifu-diffusion

Stable Diffusion on Danbooru images

1.1M runs
Public

cjwbw/stable-diffusion

stable-diffusion with negative prompts, more scheduler

65.3K runs
Public

cjwbw/whisper-downloadable-subtitles

Added downloadable subtitles for openai/whisper

2K runs
Public

cjwbw/rudalle-sr

Real-ESRGAN super-resolution model from ruDALL-E

462K runs
Public

cjwbw/stable-diffusion-high-resolution

Detailed, higher-resolution images from Stable Diffusion

71.9K runs
Public

cjwbw/clip-vit-large-patch14

openai/clip-vit-large-patch14 with Transformers

4.4M runs
Public

cjwbw/sd-textual-inversion-ugly-sonic

stable-diffusion-textual-inversion fine-tuned with ugly sonic

2K runs
Public

cjwbw/sd-textual-inversion-spyro-dragon

stable-diffusion-textual-inversion fine-tuned with spyro of the dragon STYLE

474 runs
Public

cjwbw/sd-textual-inversion

Stable Diffusion Textual Inversion

479 runs
Public

cjwbw/docentr

End-to-End Document Image Enhancement Transformer

2K runs
Public

cjwbw/style-your-hair

Pose-Invariant Hairstyle Transfer

8.5K runs
Public

cjwbw/repaint

Inpainting using Denoising Diffusion Probabilistic Models

3.6K runs
Public

cjwbw/night-enhancement

Unsupervised Night Image Enhancement

39.6K runs
Public

cjwbw/latent-diffusion-text2img

text-to-image with latent diffusion

4K runs
Public

cjwbw/openpsg

Panoptic Scene Graph Generation

1.1K runs
Public

cjwbw/mindall-e

text-to-image generation

1.7K runs
Public

cjwbw/vq-diffusion

VQ-Diffusion for Text-to-Image Synthesis

20.7K runs
Public

cjwbw/compositional-vsual-generation-with-composable-diffusion-models-pytorch

Composable Diffusion

844 runs
Public

cjwbw/micromotion-stylegan

Decoding Micromotion in Low-dimensional Latent Spaces from StyleGAN

7.9K runs
Public

cjwbw/clip-gen

Language-Free Training of a Text-to-Image Generator with CLIP

949 runs
Public

cjwbw/bigcolor

Colorization using a Generative Color Prior for Natural Images

412.7K runs
Public

cjwbw/global_​tracking_​transformers

Global Tracking Transformers

139 runs
Public

cjwbw/vqfr

Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder

134.9K runs
Public

cjwbw/diffae

Image Manipulatinon with Diffusion Autoencoders

15.4K runs
Public

cjwbw/face-align-cog

face alignment using stylegan-encoding

3.7K runs
Public

cjwbw/clip-guided-diffusion

Clip-Guided Diffusion Model for Image Generation

4.5K runs
Public

cjwbw/clip-guided-diffusion-pokemon

Generates pokemon sprites from prompt

4.9K runs
Public

cjwbw/pix2seq

Turning RGB pixels into semantically meaningful sequences

0 runs
Public

cjwbw/videocrafter2

0 runs
Public

cjwbw/chronos

0 runs
Public

cjwbw/c4ai-command-r-v01

CohereForAI c4ai-command-r-v01, Quantized model through bitsandbytes, 8-bit precision

54 runs
Public

cjwbw/minigpt-5

0 runs
Public

cjwbw/starcoder2

0 runs
Public

cjwbw/tron-legacy-diffusion

Tron Legacy Diffusion on Stable Diffusion via Dreambooth

1.5K runs
Public

cjwbw/multilingual-stable-diffusion

0 runs
Public

cjwbw/sd-x2-latent-upscaler

Stable Diffusion x2 latent upscaler

2K runs
Public

cjwbw/oneformer

One Transformer to Rule Universal Image Segmentation

0 runs
Public

cjwbw/ddnm

Zero Shot Image Restoration Using Denoising Diffusion Null-Space Model

0 runs
Public

cjwbw/chatglm-6b

bilingual language model based on General Language Model (GLM) framework

0 runs
Public

cjwbw/idefics

Open-access reproduction of large visual language model Flamingo

855 runs
Public

cjwbw/transfer-anything

0 runs
Public

cjwbw/pixart-dmd

0 runs
Public

cjwbw/maskgit

Masked Generative Image Transformer

1 run
Public

cjwbw/rpg-diffusionmaster

0 runs
Public

cjwbw/styledrop

Text-to-Image Generation in Any Style

1.2K runs
Public