chenxwh | Replicate

chenxwh / ominicontrol-spatial

Minimal and Universal Control for Diffusion Transformer - demo for Spatially aligned control

113 runs

Public

chenxwh / deepseek-vl2

Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

1.2K runs

Public

chenxwh / cosyvoice2-0.5b

Scalable Streaming Speech Synthesis with Large Language Models

8K runs

Public

chenxwh / onediffusion

One Diffusion to Generate Them All

175 runs

Public

chenxwh / nova-t2i

Autoregressive Image Generation without Vector Quantization

21 runs

Public

chenxwh / nova-t2v

Autoregressive Video Generation without Vector Quantization

40 runs

Public

chenxwh / nitrofusion

High-Fidelity Single-Step Diffusion through Dynamic Adversarial Training

181 runs

Public

chenxwh / ominicontrol-subject

Minimal and Universal Control for Diffusion Transformer - demo for Subject-driven generation

2.1K runs

Public

chenxwh / ltx-video

DiT-based video generation model for generating high-quality videos in real-time

3.8K runs

Public

chenxwh / depth-any-video

Depth Any Video with Scalable Synthetic Data

359 runs

Public

chenxwh / meissonic

Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

60 runs

Public

chenxwh / hart

Efficient Visual Generation with Hybrid Autoregressive Transformer

179 runs

Public

chenxwh / ml-depth-pro

Sharp Monocular Metric Depth in Less Than a Second

16.1K runs

Public

chenxwh / lotus

Diffusion-based Visual Foundation Model for High-quality Dense Prediction

1.2K runs

Public

chenxwh / cogview3

Finer and Faster Text-to-Image Generation via Relay Diffusion

50 runs

Public

chenxwh / depthcrafter

Generating Consistent Long Depth Sequences for Open-world Videos

604 runs

Public

chenxwh / cogvlm2-video

CogVLM2: Visual Language Models for Image and Video Understanding

673.3K runs

Public

chenxwh / cogvlm2

CogVLM2: Visual Language Models for Image and Video Understanding

8.6K runs

Public

chenxwh / diffsynth-exvideo

Extended video synthesis model that generates 128 frames

205 runs

Public

chenxwh / depth-anything-v2

Depth estimation with faster inference speed, fewer parameters, and higher depth accuracy.

2.7M runs

Public

chenxwh / omost

Convert LLM's coding to image generation

1.9K runs

Public

chenxwh / sdxl-flash

Fast sdxl with higher quality

1M runs

Public

chenxwh / hunyuandit

A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

565 runs

Public

cjwbw / hyper-sdxl-1step-t2i

Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis

1.4K runs

Public

cjwbw / parler-tts

lightweight text-to-speech (TTS) model, trained on 10.5K hours of audio data

3K runs

Public

cjwbw / pixart-dmd

0 runs

Public

cjwbw / pixart-sigma

Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

7.3K runs

Public

cjwbw / voicecraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

11.1K runs

Public

cjwbw / aniportrait-audio2vid

Audio-Driven Synthesis of Photorealistic Portrait Animations

15K runs

Public

cjwbw / chronos

0 runs

Public

cjwbw / animagine-xl-3.1

Anime-themed text-to-image stable diffusion model

10.2M runs

Public

cjwbw / starcoder2-15b

Language Models for Code

278 runs

Public

cjwbw / starcoder2

0 runs

Public

cjwbw / c4ai-command-r-v01

CohereForAI c4ai-command-r-v01, Quantized model through bitsandbytes, 8-bit precision

54 runs

Public

cjwbw / tcs-sdxl-lora

Trajectory Consistency Distillation

578 runs

Public

cjwbw / transfer-anything

0 runs

Public

cjwbw / melotts

High-quality multilingual text-to-speech library

1.9K runs

Public

cjwbw / opencodeinterpreter-ds-6.7b

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

120 runs

Public

cjwbw / supir-v0f

Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. This is the SUPIR-v0F model and does NOT use LLaVA-13b.

21.3K runs

Public

cjwbw / supir-v0q

Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. This is the SUPIR-v0Q model and does NOT use LLaVA-13b.

121.7K runs

Public

cjwbw / supir

Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. This version uses LLaVA-13b for captioning.

191.4K runs

Public

cjwbw / uform-gen2-qwen-500m

Pocket-Sized Multimodal AI For Content Understanding and Generation

413 runs

Public

cjwbw / canary-1b

Nvidia Automatic speech-to-text recognition (ASR) in 4 languages (English, German, French, Spanish)

277 runs

Public

cjwbw / lambda-eclipse

λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space

178 runs

Public

cjwbw / blipdiffusion-controlnet

Pre-trained Subject Representation for Controllable Text-to-Image Generation and Editing with ControlNet

194 runs

Public

cjwbw / blipdiffusion

Pre-trained Subject Representation for Controllable Text-to-Image Generation and Editing

356 runs

Public

cjwbw / rmgb

Background removal model developed by BRIA.AI, trained on a carefully selected dataset and is available as an open-source model for non-commercial use.

76.2K runs

Public

cjwbw / cogagent-chat

A Visual Language Model for GUI Agents

2.4K runs

Public

cjwbw / videocrafter2

0 runs

Public

cjwbw / rpg-diffusionmaster

0 runs

Public

cjwbw / depth-anything

Highly practical solution for robust monocular depth estimation by training on a combination of 1.5M labeled images and 62M+ unlabeled images

17.1K runs

Public

cjwbw / tokenflow

Consistent Diffusion Features for Consistent Video Editing

2.1K runs

Public

cjwbw / diffmorpher

Diffusion Models for Image Morphing

1.4K runs

Public

cjwbw / dreamtalk

RESEARCH/NON-COMMERCIAL USE ONLY: diffusion-based audio-driven expressive talking head generation

1.2K runs

Public

chenxwh / openvoice

Updated to OpenVoice v2: Versatile Instant Voice Cloning

91.8K runs

Public

cjwbw / faster-diffusion

Rethinking the Role of UNet Encoder in Diffusion Models

138 runs

Public

cjwbw / magicoder

LLMs with open-source code snippets for generating low-bias and high-quality instruction data for code.

365 runs

Public

cjwbw / segmind-vegart

Fast Segmind-Vega with 2-8 inference steps.

771 runs

Public

cjwbw / segmind-vega

Open-source Distilled Stable Diffusion 100% speedup

1.7K runs

Public

cjwbw / kandinskyvideo

text-to-video generation model

1.3K runs

Public

cjwbw / lavie

High-Quality Video Generation with Cascaded Latent Diffusion Models

13.9K runs

Public

cjwbw / cogvlm

powerful open-source visual language model

1.5M runs

Public

cjwbw / gorilla

Gorilla: Large Language Model Connected with Massive APIs

93 runs

Public

cjwbw / distil-whisper

Distilled version of Whisper

280 runs

Public

chenxwh / video-retalking

Audio-based Lip Synchronization for Talking Head Video

33.4K runs

Public

cjwbw / cutie

Video Object Segmentation, combined with SAM and ProPainter

410 runs

Public

cjwbw / audiosep

Separate Anything You Describe

8.6K runs

Public

cjwbw / scalecrafter

Tuning-free Higher-Resolution Visual Generation with Diffusion Models

1.1K runs

Public

cjwbw / show-1

Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

1K runs

Public

cjwbw / minigpt-5

0 runs

Public

cjwbw / daclip-uir

Controlling Vision-Language Models for Universal Image Restoration

2.2K runs

Public

cjwbw / instructcv

Instruction tuned text-to-image diffusion models as vision generalists

359 runs

Public

cjwbw / internlm-xcomposer

Advanced text-image comprehension and composition based on InternLM

164.4K runs

Public

cjwbw / idefics

Open-access reproduction of large visual language model Flamingo

855 runs

Public

cjwbw / seamless_communication

SeamlessM4T—Massively Multilingual & Multimodal Machine Translation

119.4K runs

Public

cjwbw / unival

Unified Model for Image, Video, Audio and Language Tasks

1K runs

Public

cjwbw / lorahub

Efficient Cross-Task Generalization via Dynamic LoRA Composition

87 runs

Public

cjwbw / resshift

Efficient Diffusion Model for Image Super-resolution by Residual Shifting

3.6K runs

Public

cjwbw / ledits

Real Image Editing with DDPM Inversion and Semantic Guidance

938 runs

Public

cjwbw / kandinsky-2-2-controlnet-depth

Kandinsky Image Generation with ControlNet Conditioning

3.8K runs

Public

cjwbw / styledrop

Text-to-Image Generation in Any Style

1.2K runs

Public

cjwbw / demucs

Demucs Music Source Separation

1.4M runs

Public

cjwbw / diffedit-stable-diffusion

Diffusion-based semantic image editing with mask guidance

417 runs

Public

cjwbw / wuerstchen

Efficient Pretraining of Text-to-Image Models

4.2K runs

Public

cjwbw / textdiffuser

Diffusion Models as Text Painters

2K runs

Public

cjwbw / prompt-free-diffusion

Prompt-free Diffusion

749 runs

Public

cjwbw / controlvideo

Training-free Controllable Text-to-Video Generation

2.4K runs

Public

cjwbw / fastcomposer

Tuning-Free Multi-Subject Image Generation with Localized Attention

34.1K runs

Public

cjwbw / shap-e

Generating Conditional 3D Implicit Functions

16K runs

Public

cjwbw / sadtalker

Stylized Audio-Driven Single Image Talking Face Animation

179.8K runs

Public

cjwbw / semantic-segment-anything

Adding semantic labels for segment anything

37.8K runs

Public

cjwbw / videocrafter

VideoCrafter2: Text-to-Video and Image-to-Video Generation and Editing

177.9K runs

Public

cjwbw / text2video-zero

Text-to-Image Diffusion Models are Zero-Shot Video Generators

42.1K runs

Public

cjwbw / pix2struct

Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding

6.1K runs

Public

cjwbw / chatglm-6b

bilingual language model based on General Language Model (GLM) framework

0 runs

Public

cjwbw / dolly

Fine-tuned GPT-J 6B model on the Alpaca dataset

985 runs

Public

cjwbw / stable-diffusion-2-1-unclip

Stable Diffusion v2-1-unclip Model

2.5K runs

Public

cjwbw / damo-text-to-video

Multi-stage text-to-video generation

160.7K runs

Public

cjwbw / ddnm

Zero Shot Image Restoration Using Denoising Diffusion Null-Space Model

0 runs

Public

cjwbw / unidiffuser

One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale

1.2K runs

Public

cjwbw / dreamshaper

Dream Shaper stable diffusion

1.3M runs

Public

cjwbw / zoedepth

ZoeDepth: Combining relative and metric depth

4.7M runs

Public

cjwbw / hasdx

mixed stable diffusion model

30K runs

Public

cjwbw / supermarionation

Finetuned Stable-diffusion from Gerry Anderson Supermarionation

1.9K runs

Public

cjwbw / oneformer

One Transformer to Rule Universal Image Segmentation

0 runs

Public

cjwbw / pastel-mix

high-quality highly detailed anime stylized latent diffusion model

32.1K runs

Public

cjwbw / sd-x2-latent-upscaler

Stable Diffusion x2 latent upscaler

2K runs

Public

cjwbw / t2i-adapter

Learning Adapters towards Controllable for Text-to-Image Diffusion Models

3.9K runs

Public

cjwbw / midas

Robust Monocular Depth Estimation

836.1K runs

Public

cjwbw / hard-prompts-made-easy

Gradient-Based Discrete Optimization for Prompt Tuning and Discovery

678 runs

Public

cjwbw / real-esrgan

Real-ESRGAN: Real-World Blind Super-Resolution

3.3M runs

Public

cjwbw / pix2pix-zero

Zero-shot Image-to-Image Translation

6.3K runs

Public

cjwbw / dreambooth-avatar

Dreambooth finetuning of Stable Diffusion (v1.5.1) on Avatar art style by Lambda Labs

593 runs

Public

cjwbw / gta5_artwork_diffusion

GTA5 Artwork Diffusion via Dreambooth

4.9K runs

Public

cjwbw / magifactory-t-shirt-diffusion

Generate t-shirt logos with stable-dfffusion

182.5K runs

Public

cjwbw / distilgpt2-stable-diffusion-v2

Descriptive stable diffusion prompts generation using GPT2

609 runs

Public

cjwbw / portraitplus

Portraits with stable-diffusion

24.3K runs

Public

cjwbw / anything-v4.0

high-quality, highly detailed anime-style Stable Diffusion models

3.3M runs

Public

cjwbw / anything-v3-better-vae

high-quality, highly detailed anime style stable-diffusion with better VAE

3.5M runs

Public

cjwbw / future-diffusion

Finte-tuned Stable Diffusion on high quality 3D images with a futuristic Sci-Fi theme

5.5K runs

Public

cjwbw / karlo

Text-conditional image generation model based on OpenAI's unCLIP

1.6K runs

Public

cjwbw / analog-diffusion

a dreambooth model trained on a diverse set of analog photographs

234.9K runs

Public

cjwbw / taiyi-stable-diffusion-1b-chinese-v0.1

Chinese Stable diffusion model

975 runs

Public

cjwbw / eimis_anime_diffusion

stable-diffusion models for high quality and detailed anime images

13.3K runs

Public

cjwbw / point-e

Point-E: A System for Generating 3D Point Clouds from Complex Prompts

8.8K runs

Public

cjwbw / anything-v3.0

high-quality, highly detailed anime style stable-diffusion

354.7K runs

Public

cjwbw / whisper

with large-v2 checkpoint

55K runs

Public

cjwbw / stable-diffusion-img2img-v2.1

13.6K runs

Public

cjwbw / wavyfusion

dreambooth trained on a very diverse dataset ranging from photographs to paintings

3.7K runs

Public

cjwbw / altdiffusion-m9

Multilingual Stable Diffusion

631 runs

Public

cjwbw / stable-diffusion-v2-inpainting

stable-diffusion-v2-inpainting

204.6K runs

Public

cjwbw / stable-diffusion-v2

sd-v2 with diffusers, test version!

280.8K runs

Public

cjwbw / app_icons_generator

App Icons Generator V1 (DreamBooth Model)

2.5K runs

Public

cjwbw / rembg

Remove images background

11.5M runs

Public

cjwbw / aesthetic-predictor

A linear estimator on top of clip to predict the aesthetic quality of pictures

30.4K runs

Public

cjwbw / backgroundmatting

Real-Time High-Resolution Background Matting

2.7K runs

Public

cjwbw / multilingual-stable-diffusion

0 runs

Public

cjwbw / sd_pixelart_spritesheet_generator

generate pixel art sprite sheets from four different angles with Stable-diffusion

5.2K runs

Public

cjwbw / disco-diffusion-style

Disco Diffusion style on Stable Diffusion via Dreambooth

3.6K runs

Public

cjwbw / tron-legacy-diffusion

Tron Legacy Diffusion on Stable Diffusion via Dreambooth

1.5K runs

Public

cjwbw / dreambooth-pikachu

Pikachu on Stable Diffusion via Dreambooth

525 runs

Public

cjwbw / herge-style

herge_style on Stable Diffusion via Dreambooth

2.2K runs

Public

cjwbw / van-gogh-diffusion

Van Gough on Stable Diffusion via Dreambooth

5.5K runs

Public

cjwbw / elden-ring-diffusion

fine-tuned Stable Diffusion model trained on the game art from Elden Ring

6.9K runs

Public

cjwbw / prompt-to-prompt

Prompt-to-prompt image editing with cross-attention control

2.4K runs

Public

cjwbw / stable-diffusion-v1-5

stable-diffusion with v1-5 checkpoint

35.9K runs

Public

cjwbw / stable-diffusion-aesthetic-gradients

Stable Diffusion with Aesthetic Gradients

357 runs

Public

cjwbw / whisper-downloadable-subtitles

Added downloadable subtitles for openai/whisper

2.7K runs

Public

cjwbw / clip-vit-large-patch14

openai/clip-vit-large-patch14 with Transformers

16.3M runs

Public

cjwbw / sd-textual-inversion-ugly-sonic

stable-diffusion-textual-inversion fine-tuned with ugly sonic

2K runs

Public

cjwbw / sd-textual-inversion-spyro-dragon

stable-diffusion-textual-inversion fine-tuned with spyro of the dragon STYLE

478 runs

Public

cjwbw / waifu-diffusion

Stable Diffusion on Danbooru images

1.1M runs

Public

cjwbw / sd-textual-inversion

Stable Diffusion Textual Inversion

484 runs

Public

cjwbw / stable-diffusion-high-resolution

Detailed, higher-resolution images from Stable Diffusion

73K runs

Public

cjwbw / docentr

End-to-End Document Image Enhancement Transformer

5.1K runs

Public

cjwbw / style-your-hair

Pose-Invariant Hairstyle Transfer

10.4K runs

Public

cjwbw / stable-diffusion

stable-diffusion with negative prompts, more scheduler

65.4K runs

Public

cjwbw / repaint

Inpainting using Denoising Diffusion Probabilistic Models

4.1K runs

Public

cjwbw / night-enhancement

Unsupervised Night Image Enhancement

50.6K runs

Public

cjwbw / latent-diffusion-text2img

text-to-image with latent diffusion

4.1K runs

Public

cjwbw / openpsg

Panoptic Scene Graph Generation

1.5K runs

Public

cjwbw / mindall-e

text-to-image generation

1.8K runs

Public

cjwbw / vq-diffusion

VQ-Diffusion for Text-to-Image Synthesis

20.7K runs

Public

cjwbw / pix2seq

Turning RGB pixels into semantically meaningful sequences

0 runs

Public

cjwbw / micromotion-stylegan

Decoding Micromotion in Low-dimensional Latent Spaces from StyleGAN

8.2K runs

Public

cjwbw / compositional-vsual-generation-with-composable-diffusion-models-pytorch

Composable Diffusion

849 runs

Public

cjwbw / clip-gen

Language-Free Training of a Text-to-Image Generator with CLIP

961 runs

Public

cjwbw / maskgit

Masked Generative Image Transformer

1 run

Public

cjwbw / bigcolor

Colorization using a Generative Color Prior for Natural Images

647.6K runs

Public

cjwbw / global_tracking_transformers

Global Tracking Transformers

160 runs

Public

cjwbw / vqfr

Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder

140.6K runs

Public

cjwbw / diffae

Image Manipulatinon with Diffusion Autoencoders

17.1K runs

Public

cjwbw / face-align-cog

face alignment using stylegan-encoding

9.4K runs

Public

cjwbw / rudalle-sr

Real-ESRGAN super-resolution model from ruDALL-E

487.8K runs

Public

cjwbw / clip-guided-diffusion-pokemon

Generates pokemon sprites from prompt

4.9K runs

Public

cjwbw / clip-guided-diffusion

Clip-Guided Diffusion Model for Image Generation

4.5K runs

Public

chenxwh / ominicontrol-spatial

chenxwh / deepseek-vl2

chenxwh / cosyvoice2-0.5b

chenxwh / onediffusion

chenxwh / nova-t2i

chenxwh / nova-t2v

chenxwh / nitrofusion

chenxwh / ominicontrol-subject

chenxwh / ltx-video

chenxwh / depth-any-video

chenxwh / meissonic

chenxwh / hart

chenxwh / ml-depth-pro

chenxwh / lotus

chenxwh / cogview3

chenxwh / depthcrafter

chenxwh / cogvlm2-video

chenxwh / cogvlm2

chenxwh / diffsynth-exvideo

chenxwh / depth-anything-v2

chenxwh / omost

chenxwh / sdxl-flash

chenxwh / hunyuandit

cjwbw / hyper-sdxl-1step-t2i

cjwbw / parler-tts

cjwbw / pixart-dmd

cjwbw / pixart-sigma

cjwbw / voicecraft

cjwbw / aniportrait-audio2vid

cjwbw / chronos

cjwbw / animagine-xl-3.1

cjwbw / starcoder2-15b

cjwbw / starcoder2

cjwbw / c4ai-command-r-v01

cjwbw / tcs-sdxl-lora

cjwbw / transfer-anything

cjwbw / melotts

cjwbw / opencodeinterpreter-ds-6.7b

cjwbw / supir-v0f

cjwbw / supir-v0q

cjwbw / supir

cjwbw / uform-gen2-qwen-500m

cjwbw / canary-1b

cjwbw / lambda-eclipse

cjwbw / blipdiffusion-controlnet

cjwbw / blipdiffusion

cjwbw / rmgb

cjwbw / cogagent-chat

cjwbw / videocrafter2

cjwbw / rpg-diffusionmaster

cjwbw / depth-anything

cjwbw / tokenflow

cjwbw / diffmorpher

cjwbw / dreamtalk

chenxwh / openvoice

cjwbw / faster-diffusion

cjwbw / magicoder

cjwbw / segmind-vegart

cjwbw / segmind-vega

cjwbw / kandinskyvideo

cjwbw / lavie

cjwbw / cogvlm

cjwbw / gorilla

cjwbw / distil-whisper

chenxwh / video-retalking

cjwbw / cutie

cjwbw / audiosep

cjwbw / scalecrafter

cjwbw / show-1

cjwbw / minigpt-5

cjwbw / daclip-uir

cjwbw / instructcv

cjwbw / internlm-xcomposer

cjwbw / idefics

cjwbw / seamless_​communication

cjwbw / unival

cjwbw / lorahub

cjwbw / resshift

cjwbw / ledits

cjwbw / kandinsky-2-2-controlnet-depth

cjwbw / seamless_communication

cjwbw / gta5_artwork_diffusion

cjwbw / eimis_anime_diffusion

cjwbw / app_icons_generator

cjwbw / sd_pixelart_spritesheet_generator