![](https://tjzk.replicate.delivery/models_models_cover_image/a7d00ce0-367f-48eb-a3a4-d46fef95a4f5/ezgif-4-735373e07f.gif)
chenxwh/diffsynth-exvideo
Extended video synthesis model that generates 128 frames
![](https://replicate.delivery/pbxt/nYA7g2mYWwrUNtoBGLAOQRL2xYif98iSxkeabvc2LBUQ2kDTA/color.png)
chenxwh/depth-anything-v2
Depth estimation with faster inference speed, fewer parameters, and higher depth accuracy.
![](https://tjzk.replicate.delivery/models_models_featured_image/6e1cf657-8b1e-4a24-82d9-6767e6172f8d/omost-cover.webp)
chenxwh/omost
Convert LLM's coding to image generation
![](https://tjzk.replicate.delivery/models_models_cover_image/5b3819fa-a353-4786-ad0e-0948bcbc8ba1/out.gif)
cjwbw/sadtalker
Stylized Audio-Driven Single Image Talking Face Animation
![](https://tjzk.replicate.delivery/models_models_cover_image/e2c8c0bc-67e0-40a2-8f98-50cb1314f810/out.gif)
chenxwh/t2v-turbo
Fast and High-Quality Text-to-video Generation
![](https://tjzk.replicate.delivery/models_models_featured_image/6cda3391-63b2-4943-9ff4-6cf19618cd23/sdxl-flash.webp)
chenxwh/sdxl-flash
Fast sdxl with higher quality
![](https://replicate.delivery/pbxt/WE6QOERp6ObcIluFRWkUc5Pf1KfClmLH0h2GjM02aKvEMY3SA/out.png)
chenxwh/hunyuandit
A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
![](https://tjzk.replicate.delivery/models_models_featured_image/c9f2814f-4a3d-43c3-a4fd-28b349b67640/myshell-openvoice.png)
chenxwh/openvoice
Updated to OpenVoice v2: Versatile Instant Voice Cloning
![](https://replicate.delivery/pbxt/CFkDth5uslagGBQCeHCS72yUdsu1Ra38J4mzUPah6Zd70OXJA/out-0.webp)
cjwbw/hyper-sdxl-1step-t2i
Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis
![](https://tjzk.replicate.delivery/models_models_cover_image/4da29c01-3b96-4e68-92e5-f01e832fa5ca/teaser.png)
cjwbw/voicecraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
![](https://tjzk.replicate.delivery/models_models_cover_image/3ae06556-dc25-470d-bf0c-ebc113580264/image.png)
cjwbw/parler-tts
lightweight text-to-speech (TTS) model, trained on 10.5K hours of audio data
![](https://replicate.delivery/pbxt/zn2mGosXVdI5CJuEdsxSqvR6t2VDlxShAG9BG6OUz76yabqE/out.png)
cjwbw/pixart-sigma
Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
![](https://tjzk.replicate.delivery/models_models_cover_image/3dd0a518-0b2e-4f1d-bf8a-ee01cbb17a7f/out.gif)
cjwbw/aniportrait-audio2vid
Audio-Driven Synthesis of Photorealistic Portrait Animations
![](https://replicate.delivery/pbxt/eoW2VutuKlU6VCWExLyif2ETCw7eqbeg6c9U2ewf2Uq2cAioE/out.png)
cjwbw/animagine-xl-3.1
Anime-themed text-to-image stable diffusion model
![](https://tjzk.replicate.delivery/models_models_cover_image/c2327901-06f0-4a2e-a476-d16bcf9a6be5/110470554.png)
cjwbw/starcoder2-15b
Language Models for Code
![](https://replicate.delivery/pbxt/W6FaMCAN7lIAFBqSVBZP9GVxPhX3qV50Guri6J3ShYLIJLnE/out.png)
cjwbw/tcs-sdxl-lora
Trajectory Consistency Distillation
cjwbw/melotts
High-quality multilingual text-to-speech library
cjwbw/opencodeinterpreter-ds-6.7b
OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement
![](https://replicate.delivery/pbxt/if3rev1GNfAB6IMsqqW8CqQtVP75pXvU3dLQeV6CFkVutgmJB/out.png)
cjwbw/supir-v0f
Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. This is the SUPIR-v0F model and does NOT use LLaVA-13b.
![](https://replicate.delivery/pbxt/gYLkKNiBcnZDD9dnPxlUR4iurpbr1QANec0VmA2kv3Ol6zMJA/out.png)
cjwbw/supir-v0q
Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. This is the SUPIR-v0Q model and does NOT use LLaVA-13b.
![](https://replicate.delivery/pbxt/tsZTAUpHvDorAJcaZDd1BpYsBcijNfwhXfDUCAIvrqTsdnZSA/out.png)
cjwbw/supir
Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild. This version uses LLaVA-13b for captioning.
![](https://replicate.delivery/pbxt/KPrmoP0t3TNpwsHNV5TmwJjcK1xQb0Vhw2AAtu9P7x7Sca4F/cat.jpg)
cjwbw/uform-gen2-qwen-500m
Pocket-Sized Multimodal AI For Content Understanding and Generation
![](https://replicate.delivery/pbxt/hrAQiNWHRxZqAhezHKf5rSyN39iXFCgKmS711HqubX4eE3qkA/out.png)
cjwbw/lambda-eclipse
λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space
![](https://replicate.delivery/pbxt/0j242fPjvEWkHq5KMJGvLKNJECOf7sOAUHlWUrgW8V0mbUVSA/out.png)
cjwbw/blipdiffusion
Pre-trained Subject Representation for Controllable Text-to-Image Generation and Editing
![](https://replicate.delivery/pbxt/grofXNTfdclmq0Ake6s2tiWsmNUClsXPhlqUJ3h3uoAisoqkA/out.png)
cjwbw/blipdiffusion-controlnet
Pre-trained Subject Representation for Controllable Text-to-Image Generation and Editing with ControlNet
![](https://replicate.delivery/pbxt/GQHlPkwVeGVuKyaDLYfpQJCA5I0QOari8qmbcfykD9yUjOokA/out.png)
cjwbw/rmgb
Background removal model developed by BRIA.AI, trained on a carefully selected dataset and is available as an open-source model for non-commercial use.
![](https://replicate.delivery/pbxt/KLeBpmZL2GjRa2c77grUodPILFbYdY8re3AzfuoBmQ3rEH29/Screenshot%202024-02-05%20at%2001.54.19.png)
cjwbw/cogagent-chat
A Visual Language Model for GUI Agents
![](https://tjzk.replicate.delivery/models_models_cover_image/87c6aa59-3e4d-4dcd-82e7-46c1fe959c39/output_1.gif)
cjwbw/videocrafter
VideoCrafter2: Text-to-Video and Image-to-Video Generation and Editing
![](https://replicate.delivery/pbxt/hvM31UrHXTahCl9JkHyT7nsfUELojjTDixAT8BJxpYOCswHJA/out.png)
cjwbw/depth-anything
Highly practical solution for robust monocular depth estimation by training on a combination of 1.5M labeled images and 62M+ unlabeled images
![](https://tjzk.replicate.delivery/models_models_featured_image/6380dbb5-532b-43be-8747-14f71efa636e/replicate-prediction-tjz3pwzbn.gif)
cjwbw/tokenflow
Consistent Diffusion Features for Consistent Video Editing
![](https://tjzk.replicate.delivery/models_models_cover_image/d72d4031-bba4-4179-903b-2aa8641cfd4e/ezgif-4-07be7bc9cd.gif)
chenxwh/video-retalking
Audio-based Lip Synchronization for Talking Head Video
![](https://replicate.delivery/pbxt/TuKGma0X5k4bFFXLit9pfdiI7ZPEErfmCxu6KvR9XAWqBkKSA/out.gif)
cjwbw/diffmorpher
Diffusion Models for Image Morphing
cjwbw/dreamtalk
RESEARCH/NON-COMMERCIAL USE ONLY: diffusion-based audio-driven expressive talking head generation
![](https://replicate.delivery/pbxt/zecbhv4ByEVwB6mCXZNbXSqzMNZBbfXpvnPEJeivnjWOMqLkA/out.png)
cjwbw/faster-diffusion
Rethinking the Role of UNet Encoder in Diffusion Models
cjwbw/magicoder
LLMs with open-source code snippets for generating low-bias and high-quality instruction data for code.
![](https://replicate.delivery/pbxt/SHKBu8AB2bY5LpxrsCgp2ne5BPpZ7gEqscegVfV2Qyw43gDkA/out.png)
cjwbw/segmind-vega
Open-source Distilled Stable Diffusion 100% speedup
![](https://replicate.delivery/pbxt/zLVEVzzBeBysWKrX6tUAomQAryHxd4wJpUApWGAmJO9CI4AJA/out.png)
cjwbw/segmind-vegart
Fast Segmind-Vega with 2-8 inference steps.
![](https://replicate.delivery/pbxt/JxpR9X9MatO10emxFW8GijURnrMAcQZ17fLJc5Xbu9zuQjwU/1.png)
cjwbw/cogvlm
powerful open-source visual language model
cjwbw/kandinskyvideo
text-to-video generation model
![](https://tjzk.replicate.delivery/models_models_cover_image/8e8cda41-b933-405d-8de0-3522457357ba/corgi.gif)
cjwbw/lavie
High-Quality Video Generation with Cascaded Latent Diffusion Models
![](https://tjzk.replicate.delivery/models_models_cover_image/45f2da1a-cb6c-4a1f-b17b-6ce2da9e4752/logo.png)
cjwbw/gorilla
Gorilla: Large Language Model Connected with Massive APIs
cjwbw/distil-whisper
Distilled version of Whisper
![](https://pbxt.replicate.delivery/mS8bqRWZPHLUFNXR37cDedB3OR0IqlqbHGaE8Ev8llZWnq5IA/sam_mask.png)
cjwbw/cutie
Video Object Segmentation, combined with SAM and ProPainter
![](https://tjzk.replicate.delivery/models_models_cover_image/3f5e47c0-06a8-4776-b79a-5e5c45d268e9/exp122_accordion_output.png)
cjwbw/audiosep
Separate Anything You Describe
![](https://replicate.delivery/pbxt/46puDNY7c9oKKVHo5haYa5zcNK31fOgmycBSGVCBnXxOGw3IA/out.png)
cjwbw/scalecrafter
Tuning-free Higher-Resolution Visual Generation with Diffusion Models
![](https://tjzk.replicate.delivery/models_models_cover_image/ca95e3b8-69b0-4211-84ed-d4d6e3d2e84b/converted_video.gif)
cjwbw/show-1
Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
![](https://replicate.delivery/pbxt/QOvoyhpeeblElE6kkx1UPC0kF9Gyifr8e8WfnDkaeKi7anYbE/out.png)
cjwbw/daclip-uir
Controlling Vision-Language Models for Universal Image Restoration
![](https://replicate.delivery/pbxt/7B09JVo3qwrILNlXQolxuMmYzbGTRMfJmOenmIK6lvjjhdsRA/out.png)
cjwbw/instructcv
Instruction tuned text-to-image diffusion models as vision generalists
![](https://replicate.delivery/pbxt/JcqDxAZJWep7WsZdWM0gc6Ead2ie0YDEXyemc9HXogSdpsOM/out-0%20(1).png)
cjwbw/internlm-xcomposer
Advanced text-image comprehension and composition based on InternLM
![](https://replicate.delivery/pbxt/JYZEud12pT7FPFV2MtZaSTx6lEr2Z0XMpPb8JBUSYu0zeVyIA/out-1.png)
cjwbw/wuerstchen
Efficient Pretraining of Text-to-Image Models
cjwbw/seamless_communication
SeamlessM4T—Massively Multilingual & Multimodal Machine Translation
![](https://replicate.delivery/pbxt/Yd2zn7zhfM1kcSEPox8xdt9tejoaGc8nRypYBp6yJc49cGZRA/out.png)
cjwbw/unival
Unified Model for Image, Video, Audio and Language Tasks
cjwbw/lorahub
Efficient Cross-Task Generalization via Dynamic LoRA Composition
![](https://replicate.delivery/pbxt/X0SlhKOZssZfcKYch1J7IL1oEjvrui1lXRc78d2gYrFGt6qIA/out.png)
cjwbw/resshift
Efficient Diffusion Model for Image Super-resolution by Residual Shifting
![](https://replicate.delivery/pbxt/5pWfdlNkvRTrPy2RLbKCTnw24Ph3KTE2FCcDRd3hfWIdyfkiA/out.png)
cjwbw/ledits
Real Image Editing with DDPM Inversion and Semantic Guidance
![](https://replicate.delivery/pbxt/umMfazOmw1yofU5jm3fp84NDhzuz48D1UHRcAQtYdpONM3giA/out-0.png)
cjwbw/kandinsky-2-2-controlnet-depth
Kandinsky Image Generation with ControlNet Conditioning
![](https://tjzk.replicate.delivery/models_models_cover_image/38f66095-d94b-473c-b8cb-f613d722b27d/demucs.webp)
cjwbw/demucs
Demucs Music Source Separation
![](https://replicate.delivery/pbxt/uQL969AiMJ6GIdI3iO2sxFik9amSKgpC3e0Tef4O1ohvsdIiA/out.png)
cjwbw/diffedit-stable-diffusion
Diffusion-based semantic image editing with mask guidance
![](https://replicate.delivery/pbxt/H3HtX6WsVnK1MRe5SWtXTN8n64OjIRdRALEfnpVLjDTbnkCRA/out_0.png)
cjwbw/textdiffuser
Diffusion Models as Text Painters
![](https://replicate.delivery/pbxt/J3nCFqcUvg4dD92f7KoduC9hZ0qhzntY9QBTrIygypcvMfBRA/out.png)
cjwbw/prompt-free-diffusion
Prompt-free Diffusion
cjwbw/controlvideo
Training-free Controllable Text-to-Video Generation
![](https://replicate.delivery/pbxt/H0Canvu42iL0B1NCbZBeAjrruuy8PlNsgo1EDTlJpgjDZZdIA/out_%7Bi%7D.gif)
cjwbw/shap-e
Generating Conditional 3D Implicit Functions
![](https://replicate.delivery/pbxt/gGnUracxw96oBVc5faffeEv4LDsOqJFyY3vvq9S1tp3axR1DB/out_0.png)
cjwbw/fastcomposer
Tuning-Free Multi-Subject Image Generation with Localized Attention
![](https://replicate.delivery/pbxt/zttLZCfyXp1aDqS5Gd1OeDuiaPGnN722jFio5hNeXyUSeXGDB/seg_out.png)
cjwbw/semantic-segment-anything
Adding semantic labels for segment anything
cjwbw/text2video-zero
Text-to-Image Diffusion Models are Zero-Shot Video Generators
![](https://replicate.delivery/pbxt/IYTqGFhESyTxWX9AgFABgMdfStkVTkDZrNGCqhS6VCijFLgj/ai2d-demo.jpg)
cjwbw/pix2struct
Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding
cjwbw/dolly
Fine-tuned GPT-J 6B model on the Alpaca dataset
![](https://replicate.delivery/pbxt/ixZq1VlmAtpcB5fGVIb8DV3rmw4RYvHO7zmfDx2QEeO0iVWhA/out-0.png)
cjwbw/stable-diffusion-2-1-unclip
Stable Diffusion v2-1-unclip Model
cjwbw/damo-text-to-video
Multi-stage text-to-video generation
![](https://replicate.delivery/pbxt/9V1JDXddn1osJp1nffLISS38MtF3BdVT46Xg5npK2cFHZAoQA/sample.png)
cjwbw/unidiffuser
One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale
![](https://replicate.delivery/pbxt/YxgdzKyEfdRiWCMfxfhAMvBYqbyP3zdufqlikRiuVcTzuwaCB/out-0.png)
cjwbw/dreamshaper
Dream Shaper stable diffusion
![](https://replicate.delivery/pbxt/Yiy3JvNLmMpkKZhuAamOPUjdFUYn5OIl0xPlu04aTfBpbMSIA/out.png)
cjwbw/zoedepth
ZoeDepth: Combining relative and metric depth
![](https://replicate.delivery/pbxt/LfGfMOJzqAlUx04dvXIfMkwCI2ofpQIJ8LzYUiiTzBf3v7iEC/out-0.png)
cjwbw/hasdx
mixed stable diffusion model
![](https://replicate.delivery/pbxt/ZeU0zaMIaEyDAqdo3eDTCFjtyo1jgBZJIiVan2ad33drNeHhA/out-0.png)
cjwbw/supermarionation
Finetuned Stable-diffusion from Gerry Anderson Supermarionation
![](https://replicate.delivery/pbxt/pk6T3cRJcv7DP10EfEnbPeEnVxkDZjC4BMBQX2h0LfuGpxAhA/out-0.png)
cjwbw/pastel-mix
high-quality highly detailed anime stylized latent diffusion model
![](https://replicate.delivery/pbxt/r7XfIsf3BuhfMoffJ7VVz1RWIk1PSugrJuHiycl99gfYI1fPIA/out.png)
cjwbw/real-esrgan
Real-ESRGAN: Real-World Blind Super-Resolution
![](https://replicate.delivery/pbxt/KlQ9UXTfyR3KPKw2497DmNwra9tr0FqTT7tfecr4gT6oie9BB/out_0.png)
cjwbw/t2i-adapter
Learning Adapters towards Controllable for Text-to-Image Diffusion Models
![](https://replicate.delivery/pbxt/tweLmgVgBlRUOqF4JWvZI6wLlBiYaeNtDXKQGcthB8S92YegA/out.png)
cjwbw/midas
Robust Monocular Depth Estimation
![](https://replicate.delivery/pbxt/0RBhumGLhJJ4PZdiOhzYmMvF6fAza7PKVf0Eo7pKsunzOsdQA/out.png)
cjwbw/hard-prompts-made-easy
Gradient-Based Discrete Optimization for Prompt Tuning and Discovery
![](https://replicate.delivery/pbxt/NZMUartw9GapO9wfD83IDxbaQN1hyA4NspaGcBOj40O0irOIA/reconstruction.png)
cjwbw/pix2pix-zero
Zero-shot Image-to-Image Translation
![](https://replicate.delivery/pbxt/maedOFyCeapXeIGbiQmDKZxbRgRxGw1hxXkeuhUXoHnTNDwBB/out-0.png)
cjwbw/dreambooth-avatar
Dreambooth finetuning of Stable Diffusion (v1.5.1) on Avatar art style by Lambda Labs
![](https://replicate.delivery/pbxt/DCTqvDzhzX7CGlON3ClPNRgaRuPCDtwZes8OOVbDtBE3QAOIA/out-0.png)
cjwbw/gta5_artwork_diffusion
GTA5 Artwork Diffusion via Dreambooth
![](https://replicate.delivery/pbxt/URTxAtJyid4MF9TyU2fOp4w2K0oryOPNzobzu8RlZMnAejaQA/out-0.png)
cjwbw/magifactory-t-shirt-diffusion
Generate t-shirt logos with stable-dfffusion
cjwbw/distilgpt2-stable-diffusion-v2
Descriptive stable diffusion prompts generation using GPT2
![](https://replicate.delivery/pbxt/nsv3z04pENLwCJFCNoKOCCpzPwmYBJX2YBFCB9eagHLNfdXQA/out-0.png)
cjwbw/portraitplus
Portraits with stable-diffusion
![](https://replicate.delivery/pbxt/qcrZfAJuW71mNyiN9MZh65WW8LkTKLQHAPtasZdE0RMXxZLIA/out-0.png)
cjwbw/anything-v4.0
high-quality, highly detailed anime-style Stable Diffusion models
![](https://replicate.delivery/pbxt/wvwb7KwCRtYJCtkMqinlgRKgfhJxLuHnBNzBlTXvezYSXITQA/out.gif)
cjwbw/point-e
Point-E: A System for Generating 3D Point Clouds from Complex Prompts
![](https://replicate.delivery/pbxt/s7CafGeN2Nle1p4Ij2cjnodU1qFVjzPYhqeYq7hD80G63vRBB/out-0.png)
cjwbw/anything-v3-better-vae
high-quality, highly detailed anime style stable-diffusion with better VAE
![](https://replicate.delivery/pbxt/brazk2O1f6xpCSqbEmc59fyLkSsB3cOaI0M2aaaHeo0gf1HBB/out-0.png)
cjwbw/future-diffusion
Finte-tuned Stable Diffusion on high quality 3D images with a futuristic Sci-Fi theme
![](https://replicate.delivery/pbxt/b8FzeU7UcQwjZitfHsNxa6WxtnEdiJfO8UdMSgZhIlyqaChgA/out-0.png)
cjwbw/karlo
Text-conditional image generation model based on OpenAI's unCLIP
![](https://replicate.delivery/pbxt/SscVjveDe9lQSkkfmoO3gXtO7K3YgU95O00unSi5GV0cPSfAB/out-0.png)
cjwbw/analog-diffusion
a dreambooth model trained on a diverse set of analog photographs
![](https://replicate.delivery/pbxt/k8hb6zP0A5IwBR0jwryeB1BEpY4m1IIvl7FppuDUh0Re3nPQA/out-0.png)
cjwbw/taiyi-stable-diffusion-1b-chinese-v0.1
Chinese Stable diffusion model
![](https://replicate.delivery/pbxt/xaFvdxJuYLomP5LZ3OOfHSnN1dMFe0D6edOEdWop4eLSRWeBC/out-0.png)
cjwbw/eimis_anime_diffusion
stable-diffusion models for high quality and detailed anime images
![](https://replicate.delivery/pbxt/QoTPURYKznofNysqYberIhfxZiCEcpRTuLBau9cRgnAIaaXgA/out-0.png)
cjwbw/anything-v3.0
high-quality, highly detailed anime style stable-diffusion
cjwbw/whisper
with large-v2 checkpoint
![](https://replicate.delivery/pbxt/Gum5FegZm90f20g6HCmQCXSU8o3sfIIsWNhjweIu4We0XiSBC/out-0.png)
cjwbw/stable-diffusion-img2img-v2.1
![](https://replicate.delivery/pbxt/X3QiWcxedlyrekButiNlJqVDdfRfcy9merjX6yfOSzAaqZACE/out-0.png)
cjwbw/wavyfusion
dreambooth trained on a very diverse dataset ranging from photographs to paintings
![](https://replicate.delivery/pbxt/po4e5mtsfZiGv0CF3z8BDgNBcBSIHZdJ7yPvuy8J0QSfKiOgA/out-0.png)
cjwbw/altdiffusion-m9
Multilingual Stable Diffusion
![](https://replicate.delivery/pbxt/Z1qXRKCfPlXuXKl2nhfb3eL3Y1TfqGylycuig9eClEOsi8aAC/out-0.png)
cjwbw/stable-diffusion-v2
sd-v2 with diffusers, test version!
![](https://replicate.delivery/pbxt/R0kqaA8PUMarFx2SoelD46bYGO8MwPUx1XiMUdq82YiUv3CIA/out-0.png)
cjwbw/stable-diffusion-v2-inpainting
stable-diffusion-v2-inpainting
![](https://replicate.delivery/pbxt/2hczaMwD9xrsIR8h3Cl8iYGbHaCdFhIOMZ0LfoYfHlKuuIBQA/out.png)
cjwbw/rembg
Remove images background
![](https://replicate.delivery/pbxt/rtuQT8WkpWZQFRKc9fWONz3ym06DjmqfKycoYC4G2BvBMcCQA/out-0.png)
cjwbw/app_icons_generator
App Icons Generator V1 (DreamBooth Model)
![](https://replicate.delivery/pbxt/HnxgWuBCi6XFLNmLkJZ4syITRhL5stYvNnAX5axgE8Uox68s/lovely-cat-as-domestic-animal-view-pictures-182393057.jpg)
cjwbw/aesthetic-predictor
A linear estimator on top of clip to predict the aesthetic quality of pictures
![](https://replicate.delivery/pbxt/cUzNjzc65YoqIhj3Kbq7qCHOgn0v3J8k8fsrFf87A7AYGDBQA/out.png)
cjwbw/backgroundmatting
Real-Time High-Resolution Background Matting
![](https://replicate.delivery/pbxt/XOtc8okRRWofCSuvakqgczhIffSdsfPvn2aHQZquDutcMmAAB/out-0.png)
cjwbw/sd_pixelart_spritesheet_generator
generate pixel art sprite sheets from four different angles with Stable-diffusion
![](https://replicate.delivery/pbxt/7hkEoMctCQr4JJltbx9fYzw2ZC1R31kcJ6yRRbuCPKSErAfPA/out-0.png)
cjwbw/disco-diffusion-style
Disco Diffusion style on Stable Diffusion via Dreambooth
![](https://replicate.delivery/pbxt/fgSWe4pUA9u6EEkHTmlY1il4eK1ZNjA9r3EQn9Y1Dk5JwB8fA/out-0.png)
cjwbw/dreambooth-pikachu
Pikachu on Stable Diffusion via Dreambooth
![](https://replicate.delivery/pbxt/t1BB2kXO4So4KlwiwkZw2e7JgZfibuxoeeFHCWNHD9K3RevfD/out-0.png)
cjwbw/herge-style
herge_style on Stable Diffusion via Dreambooth
![](https://replicate.delivery/pbxt/gVidGfayGfulGkTWWSGCCpQuA0sjkycleounGgfEoGQQr93fB/out-0.png)
cjwbw/van-gogh-diffusion
Van Gough on Stable Diffusion via Dreambooth
![](https://replicate.delivery/pbxt/xXh8lSe8UOQMCSW4kpnOneWRGuBtXNfDFTtqfjsQ9rq87rwfB/out-0.png)
cjwbw/elden-ring-diffusion
fine-tuned Stable Diffusion model trained on the game art from Elden Ring
![](https://replicate.delivery/pbxt/eytrkNkytk1tBCf5yU8vYX9UnInSX4egAOYs76mTeegGC1WfD/out_p2p.png)
cjwbw/prompt-to-prompt
Prompt-to-prompt image editing with cross-attention control
![](https://replicate.delivery/pbxt/Dp8ewmDVzYTBRCTzWflZLPLpAxNOOh4ITj3WZYFPO217leyfA/out-0.png)
cjwbw/stable-diffusion-v1-5
stable-diffusion with v1-5 checkpoint
![](https://replicate.delivery/pbxt/gk2Y0pI9XzaeKiHzf88u86BKa9bkvaKMyTjeVlFb5LW8p6yfA/out-0.png)
cjwbw/stable-diffusion-aesthetic-gradients
Stable Diffusion with Aesthetic Gradients
![](https://tjzk.replicate.delivery/models_models_featured_image/8547be91-000a-4213-b067-2c2d0b2ae5d0/out-0-9.png)
cjwbw/waifu-diffusion
Stable Diffusion on Danbooru images
![](https://replicate.delivery/pbxt/HR9BqOj0nezdSif52eDFLFClW8AIemuiYQvxDMTp5SEkv6KAB/out-0.png)
cjwbw/stable-diffusion
stable-diffusion with negative prompts, more scheduler
cjwbw/whisper-downloadable-subtitles
Added downloadable subtitles for openai/whisper
![](https://replicate.delivery/mgxm/588170af-559f-454e-967d-8fb6c7f8304b/out.png)
cjwbw/rudalle-sr
Real-ESRGAN super-resolution model from ruDALL-E
![](https://tjzk.replicate.delivery/models_models_featured_image/bcbb62ba-8e42-4a33-af47-3d93d9bd9815/out-1.png)
cjwbw/stable-diffusion-high-resolution
Detailed, higher-resolution images from Stable Diffusion
![](https://replicate.delivery/mgxm/36b04aec-efe2-4dea-9c9d-a5faca68b2b2/000000039769.jpg)
cjwbw/clip-vit-large-patch14
openai/clip-vit-large-patch14 with Transformers
![](https://replicate.delivery/mgxm/d3c1458b-c2a7-446d-8799-9bd0bf2d6cbc/out-0.png)
cjwbw/sd-textual-inversion-ugly-sonic
stable-diffusion-textual-inversion fine-tuned with ugly sonic
![](https://replicate.delivery/mgxm/190cf37d-32d0-45fd-b5f4-3460f8c76c64/out-0.png)
cjwbw/sd-textual-inversion-spyro-dragon
stable-diffusion-textual-inversion fine-tuned with spyro of the dragon STYLE
![](https://replicate.delivery/mgxm/a2029c41-1402-428e-9f49-52aa92371655/out-3.png)
cjwbw/sd-textual-inversion
Stable Diffusion Textual Inversion
![](https://replicate.delivery/mgxm/218c6b06-a01f-41ea-8914-132c5c6479c6/output.png)
cjwbw/docentr
End-to-End Document Image Enhancement Transformer
![](https://replicate.delivery/mgxm/c9f54dd4-66fe-4379-b9e5-64e29fb5827d/output.png)
cjwbw/style-your-hair
Pose-Invariant Hairstyle Transfer
![](https://replicate.delivery/mgxm/136f4904-fb3b-420f-83a5-29510b02c6ce/masked_image.png)
cjwbw/repaint
Inpainting using Denoising Diffusion Probabilistic Models
![](https://replicate.delivery/mgxm/60c4c0d8-c82f-42e0-96ee-71392d32b6fe/output.png)
cjwbw/night-enhancement
Unsupervised Night Image Enhancement
![](https://replicate.delivery/mgxm/5aa05a1a-9c77-4759-a6a9-4f7a1237ff0a/output_7.png)
cjwbw/latent-diffusion-text2img
text-to-image with latent diffusion
![](https://replicate.delivery/mgxm/20987b5e-e1e0-4acd-b04d-79d08ecca3c4/output_4.png)
cjwbw/openpsg
Panoptic Scene Graph Generation
![](https://replicate.delivery/mgxm/d514f01d-7386-41d3-b330-af415d01025c/output_3.png)
cjwbw/mindall-e
text-to-image generation
![](https://replicate.delivery/mgxm/2eeb886b-a24e-470a-bdb7-1fe8b62fcfd6/output_3.png)
cjwbw/vq-diffusion
VQ-Diffusion for Text-to-Image Synthesis
![](https://replicate.delivery/mgxm/9de5cffc-da99-43d0-9391-7ef4e9d4be1e/output.png)
cjwbw/compositional-vsual-generation-with-composable-diffusion-models-pytorch
Composable Diffusion
![](https://tjzk.replicate.delivery/models_models_cover_image/2293178e-5443-422e-83e4-6d93f3f1c5cf/output.gif)
cjwbw/micromotion-stylegan
Decoding Micromotion in Low-dimensional Latent Spaces from StyleGAN
![](https://replicate.delivery/mgxm/50aee8d2-4bda-4bac-ab67-95ebc6f0a5a1/output.png)
cjwbw/clip-gen
Language-Free Training of a Text-to-Image Generator with CLIP
![](https://replicate.delivery/mgxm/7c716bfc-2b84-4192-9127-0d58e187011c/output.png)
cjwbw/bigcolor
Colorization using a Generative Color Prior for Natural Images
cjwbw/global_tracking_transformers
Global Tracking Transformers
![](https://replicate.delivery/mgxm/bc7a348f-60f5-40fb-89e4-ed3dd250cb31/output_0.png)
cjwbw/vqfr
Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder
![](https://replicate.delivery/mgxm/a174e57b-1fdf-40b7-8132-3307a7a8d6c0/manipulated_img.png)
cjwbw/diffae
Image Manipulatinon with Diffusion Autoencoders
![](https://replicate.delivery/mgxm/27f20f9e-e16f-4b56-aa9c-a8599b44cd7b/output.png)
cjwbw/face-align-cog
face alignment using stylegan-encoding
![](https://replicate.delivery/mgxm/449ab8b9-a448-4e1c-8ff3-8d2f51fb3814/out.png)
cjwbw/clip-guided-diffusion
Clip-Guided Diffusion Model for Image Generation
![](https://tjzk.replicate.delivery/models_models_featured_image/2f58cde3-8345-42dc-bdd6-9a7d953c176b/Screen_Shot_2022-01-21_at_7.02.png)
cjwbw/clip-guided-diffusion-pokemon
Generates pokemon sprites from prompt
![](https://replicate.delivery/pbxt/u11aKdNrcaboMBROVb5bI5udrXrGZ5Fihr4I6ofr8Z0sXxmIA/out_0.png)
cjwbw/styledrop
Text-to-Image Generation in Any Style
![](https://replicate.delivery/pbxt/JZLrEi8rmF5aqLeg0iney67r2dPhCcGudYNedSLuHa0chpqk/image%20(1).png)
cjwbw/idefics
Open-access reproduction of large visual language model Flamingo
![](https://tjzk.replicate.delivery/models_models_cover_image/7359069d-973b-45b2-9a67-4a20f823c53b/PavisyFb_400x400.png)
cjwbw/c4ai-command-r-v01
CohereForAI c4ai-command-r-v01, Quantized model through bitsandbytes, 8-bit precision
![](https://replicate.delivery/pbxt/wekk2pzrvelqnEhlSt91NWUYVqmCt1maHmFgmipQyv37nYgQA/out.png)
cjwbw/sd-x2-latent-upscaler
Stable Diffusion x2 latent upscaler
cjwbw/canary-1b
Nvidia Automatic speech-to-text recognition (ASR) in 4 languages (English, German, French, Spanish)
cjwbw/videocrafter2
cjwbw/maskgit
Masked Generative Image Transformer
cjwbw/multilingual-stable-diffusion
![](https://replicate.delivery/pbxt/DCrB6rfbSCQoWaDASvzIxZe0jtfJWf19Nk4i2in5jxx0qE4fB/out-0.png)
cjwbw/tron-legacy-diffusion
Tron Legacy Diffusion on Stable Diffusion via Dreambooth
cjwbw/rpg-diffusionmaster
cjwbw/pix2seq
Turning RGB pixels into semantically meaningful sequences
cjwbw/oneformer
One Transformer to Rule Universal Image Segmentation
cjwbw/chronos
cjwbw/starcoder2
cjwbw/transfer-anything
cjwbw/pixart-dmd
cjwbw/chatglm-6b
bilingual language model based on General Language Model (GLM) framework
cjwbw/ddnm
Zero Shot Image Restoration Using Denoising Diffusion Null-Space Model
cjwbw/minigpt-5