lucataco/minicpm-v-2

OpenBMB MiniCPM-V 2.8B is a strong multimodal large language model for efficient end-side deployment

43 runs
Public

lucataco/nous-hermes-2-mixtral-8x7b-dpo

Nous Hermes 2 Mixtral 8x7B DPO is a Nous Research model trained over the Mixtral 8x7B MoE LLM

32 runs
Public

lucataco/sdxs-512-0.9

sdxs-512-0.9 can generate high-resolution images in real-time based on prompt texts, trained using score distillation and feature matching

506 runs
Public

lucataco/mvsep-mdx23-music-separation

Model for Sound demixing challenge 2023: Music Demixing Track - MDX'23

274 runs
Public

lucataco/moondream2

moondream2 is a small vision language model designed to run efficiently on edge devices

6.2K runs
Public

lucataco/deepseek-vl-7b-base

DeepSeek-VL: An open-source Vision-Language Model designed for real-world vision and language understanding applications

1.4K runs
Public

lucataco/rembg-video

Remove video background

96 runs
Public

lucataco/clip-vit-base-patch32

openai/clip-vit-large-patch32

70 runs
Public

lucataco/sdxl-inpainting

SDXL Inpainting developed by the HF Diffusers team

104.6K runs
Public

lucataco/whisperspeech-small

An Open Source text-to-speech system built by inverting Whisper

1.4K runs
Public

lucataco/zeta-editing

Zero-Shot Text-Based Audio Editing Using DDPM Inversion

605 runs
Public

lucataco/differential-diffusion

Modify an image with a prompt and a depth image

223 runs
Public

lucataco/juggernaut-xl-v9

Juggernaut XL v9

193.6K runs
Public

lucataco/sdxl-lightning-multi-controlnet

SDXL lightning mult-controlnet, img2img & inpainting

712 runs
Public

lucataco/dreamshaper-xl-lightning

dreamshaper-xl-lightning is a Stable Diffusion model that has been fine-tuned on SDXL

38.2K runs
Public

lucataco/proteus-v0.4

ProteusV0.4: The Style Update

20.4K runs
Public

lucataco/animate-diff-vid2vid

AnimateDiff video to video

295 runs
Public

lucataco/depth-anything-video-sbs

POC implementation of Depth-anything to produce a 3D SBS video

141 runs
Public

lucataco/proteus-v0.4-lightning

ProteusV0.4: The Style Update - enhances stylistic capabilities, similar to Midjourney's approach, rather than advancing prompt comprehension

35K runs
Public

lucataco/rgb2grayscale-cuda

POC CUDA implementation of an rgb2grayscale function

90 runs
Public

lucataco/deep3d

Deep3D: Real-Time end-to-end 2D-to-3D Video Conversion, based on deep learning

260 runs
Public

lucataco/proteus-v0.3

ProteusV0.3: The Anime Update

14.3K runs
Public

lucataco/glpn-nyu

Global-Local Path Networks (GLPN) model trained on NYUv2 for Monocular Depth Estimation

45 runs
Public

lucataco/nomic-embed-text-v1

nomic-embed-text-v1 is 8192 context length text encoder that surpasses OpenAI text-embedding-ada-002 and text-embedding-3-small performance on short and long context tasks

168 runs
Public

lucataco/depth-anything-video

Depth Anything on full video files

205 runs
Public

lucataco/phixtral-2x2_​8

phixtral-2x2_8 is the first Mixure of Experts (MoE) made with two microsoft/phi-2 models, inspired by the mistralai/Mixtral-8x7B-v0.1 architecture

169 runs
Public

lucataco/bge-m3

BGE-M3, the first embedding model which supports multiple retrieval mode, multilingual and multi-granularity retrieval.

130 runs
Public

lucataco/qwen1.5-72b

Qwen1.5 is the beta version of Qwen2, a transformer-based decoder-only language model pretrained on a large amount of data

3.7K runs
Public

lucataco/qwen1.5-14b

Qwen1.5 is the beta version of Qwen2, a transformer-based decoder-only language model pretrained on a large amount of data

213 runs
Public

lucataco/qwen1.5-7b

Qwen1.5 is the beta version of Qwen2, a transformer-based decoder-only language model pretrained on a large amount of data

55 runs
Public

lucataco/qwen1.5-4b

Qwen1.5 is the beta version of Qwen2, a transformer-based decoder-only language model pretrained on a large amount of data

12 runs
Public

lucataco/qwen1.5-1.8b

Qwen1.5 is the beta version of Qwen2, a transformer-based decoder-only language model pretrained on a large amount of data

67 runs
Public

lucataco/qwen1.5-0.5b

Qwen1.5 is the beta version of Qwen2, a transformer-based decoder-only language model pretrained on a large amount of data

34 runs
Public

lucataco/olmo-7b

OLMo is a series of Open Language Models designed to enable the science of language models

69 runs
Public

lucataco/rave

RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models

177 runs
Public

lucataco/diffusionlight

DiffusionLight: Light Probes by Painting a Chrome Ball

245 runs
Public

lucataco/phi-2

Phi-2 by Microsoft

2.2K runs
Public

lucataco/img-and-audio2video

Take an image and an audio file and create a video clip

575 runs
Public

lucataco/watermark_​detector

amrul-hzz's fine-tuned version of vit-base-patch16-224-in21k for watermark image detection

131 runs
Public

lucataco/moondream1

(Research only) Moondream1 is a vision language model that performs on par with models twice its size

9.4K runs
Public

lucataco/proteus-v0.2

Proteus v0.2 shows subtle yet significant improvements over Version 0.1. It demonstrates enhanced prompt understanding that surpasses MJ6, while also approaching its stylistic capabilities.

400.1K runs
Public

lucataco/siglip

SigLIP proposes to replace the loss function used in CLIP by a simple pairwise sigmoid loss

107 runs
Public

lucataco/wizardcoder-33b-v1.1-gguf

WizardCoder: Empowering Code Large Language Models with Evol-Instruct

105 runs
Public

lucataco/magnet

MAGNeT: Masked Audio Generation using a Single Non-Autoregressive Transformer

995 runs
Public

lucataco/proteus-v0.1

ProteusV0.1 uses OpenDalleV1.1 as a base and further refines prompt adherence and stylistic capabilities to a measurable degree

6.6K runs
Public

lucataco/pheme

Pheme generates a variety of conversational voices in 16 kHz for phone-call applications

354 runs
Public

lucataco/pasd-magnify

(Academic and Non-commercial use only) Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization

25.9K runs
Public

lucataco/sdxl-deepcache

SDXL using DeepCache

3K runs
Public

lucataco/tinyllama-1.1b-chat-v1.0

This is the chat model finetuned on top of TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T

249 runs
Public

lucataco/open-dalle-v1.1

A unique fusion that showcases exceptional prompt adherence and semantic understanding, it seems to be a step above base SDXL and a step closer to DALLE-3 in terms of prompt comprehension

83.3K runs
Public

lucataco/diffusion-motion-transfer

Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer

159 runs
Public

lucataco/singing_​voice_​conversion

Amphion Singing Voice Conversion: DiffWaveNetSVC

524 runs
Public

lucataco/ip-adapter-faceid

(Research only) IP-Adapter-FaceID can generate various style images conditioned on a face with only text prompts

23.7K runs
Public

lucataco/dreamshaper-xl-turbo

DreamShaper is a general purpose SD model that aims at doing everything well, photos, art, anime, manga. It's designed to match Midjourney and DALL-E.

113.5K runs
Public

lucataco/dpo-sdxl

Direct Preference Optimization (DPO) is a method to align diffusion models to text human preferences by directly optimizing on human comparison data

2.2K runs
Public

lucataco/seamless_​communication

FacebookResearch/SeamlessM4T v2 - Massively Multilingual & Multimodal Machine Translation

633 runs
Public

lucataco/stable-diffusion-x4-upscaler

Stable Diffusion x4 upscaler model

5.3K runs
Public

lucataco/resemble-enhance

AI-driven audio enhancement for your audio files, powered by Resemble AI

854 runs
Public

lucataco/segmind-vega

Segmind-Vega Model is a distilled version of SDXL, offering a 70% reduction in size and an 100% speedup

642 runs
Public

lucataco/style-aligned

GoogleAI: Style Aligned Image Generation via Shared Attention

1.1K runs
Public

lucataco/sdxl-img-blend

SDXL Image Blending

41.7K runs
Public

lucataco/demofusion-enhance

Image to Image enhancer using DemoFusion

8.6K runs
Public

lucataco/vid2openpose

Video to OpenPose

1.3K runs
Public

lucataco/magic-animate-openpose

MagicAnimate using an OpenPose input video

1.8K runs
Public

lucataco/playground-v2

Playground v2 is a diffusion-based text-to-image generative model trained from scratch. Try out all 3 models here

2.9K runs
Public

lucataco/vid2densepose

Convert your videos to DensePose and use it with MagicAnimate

3.1K runs
Public

lucataco/magic-animate

MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

28.7K runs
Public

lucataco/cross-image-attention

Given two images depicting a source structure and a target appearance, generate an image merging the structure of one image with the appearance of the other

356 runs
Public

lucataco/pixart-xl-2

PixArt-Alpha 1024px is a transformer-based text-to-image diffusion system trained on text embeddings from T5

41K runs
Public

lucataco/pixart-lcm-xl-2

PixArt-Alpha LCM is a transformer-based text-to-image diffusion system trained on text embeddings from T5

3K runs
Public

lucataco/demofusion

DemoFusion: Democratising High-Resolution Image Generation With No 💰

8.1K runs
Public

lucataco/interpany-clearer

InterpAny-Clearer: Clearer anytime frame interpolation & Manipulated interpolation

9.1K runs
Public

lucataco/xtts-v2

Coqui XTTS-v2: Multilingual Text To Speech Voice Cloning

107.6K runs
Public

lucataco/controlnet-tile

Controlnet v1.1 - Tile Version

3.5K runs
Public

lucataco/real-esrgan-video

Real-ESRGAN Video Upscaler

25.2K runs
Public

lucataco/seine

Image-to-video - SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

3.7K runs
Public

lucataco/nsfw_​image_​detection

Falcons.ai Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification

1.4M runs
Public

lucataco/animate-diff-sdxl-lcm

Animate Your Personalized Text-to-Image Diffusion Models with SDXL and LCM

289 runs
Public

lucataco/vseq2vseq

Text to video diffusion model with variable length frame conditioning for infinite length video

378 runs
Public

lucataco/dreamshaper7-img2img-lcm

Dreamshaper-7 img2img with LCM LoRA for faster inference

19.2K runs
Public

lucataco/realvisxl2-lcm

RealvisXL-v2.0 with LCM LoRA - requires fewer steps (4 to 8 instead of the original 40 to 50)

246.2K runs
Public

lucataco/modelscope-facefusion

Auto fuse a user's face onto the template image, with a similar appearance to the user

4.7K runs
Public

lucataco/ip_​adapter-face-inpaint

A combination of ip_adapter SDv1.5 and mediapipe-face to inpaint a face

1.8K runs
Public

lucataco/sdxl-niji-se

SDXL_Niji_Special Edition

32.1K runs
Public

lucataco/sdxl-lcm-zeke

A fine-tuned SDXL-LCM LoRA based on the photos of Zeke

420 runs
Public

lucataco/sdxl-lcm

Latent Consistency Model (LCM): SDXL, distills the original model into a version that requires fewer steps (4 to 8 instead of the original 25 to 50)

363.9K runs
Public

lucataco/ip_​adapter-sdxl-face

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate SDXL images with an image prompt

23.5K runs
Public

lucataco/sdxl-lcm-loras

POC of SDXL-LCM LoRA combined with Replicate LoRA for 4 second inference times

345 runs
Public

lucataco/lcm-ssd-1b

Latent Consistency Model (LCM): SSD-1B, is a LCM distilled version that reduces the number of inference steps needed to only 2 - 8 steps

1.6K runs
Public

lucataco/ip_​adapter-face

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate SDv1.5 images with an image prompt

1.3K runs
Public

lucataco/realvisxl-v2.0

Implementation of SDXL RealVisXL_V2.0

242.1K runs
Public

lucataco/realvisxl2-lora-inference

POC to run inference on Realvisxl2 LoRAs

2.3K runs
Public

lucataco/realvisxl2-lora-training

POC to train Realvisxl2 LoRAs

261 runs
Public

lucataco/ssd-1b

Segmind Stable Diffusion Model (SSD-1B) is a distilled 50% smaller version of SDXL, offering a 60% speedup while maintaining high-quality text-to-image generation capabilities

891.8K runs
Public

lucataco/ssd-lora-inference

POC to run inference on SSD-1B LoRAs

2.4K runs
Public

lucataco/ssd-lora-training

POC to train SSD-1B LoRAs for cheaper & faster training

218 runs
Public

lucataco/ssd-1b-txt2img_​batch

Batch mode for Segmind Stable Diffusion Model (SSD-1B) txt2img

1.2K runs
Public

lucataco/realvisxl-v2-img2img

Implementation of SDXL RealVisXL_V2.0 img2img

6.7K runs
Public

lucataco/thinkdiffusionxl

ThinkDiffusionXL is a go-to model capable of amazing photorealism that's also versatile enough to generate high-quality images across a variety of styles and subjects without needing to be a prompting genius

12.5K runs
Public

lucataco/kosmos-2

Grounding Multimodal Large Language Models to the World

1.6K runs
Public

lucataco/ssd-1b-img2img

Segmind Stable Diffusion Model (SSD-1B) img2img

3K runs
Public

lucataco/sdxl

SDXL v1.0 - A text-to-image generative AI model that creates beautiful images

306.5K runs
Public

lucataco/realvisxl-v1-img2img

Implementation of SDXL RealVisXL_V1.0 img2img

3.2K runs
Public

lucataco/dolphin-2.2.1-mistral-7b

Mistral-7B-v0.1 fine tuned for chat with the Dolphin dataset (an open-source implementation of Microsoft's Orca)

27K runs
Public

lucataco/dolphin-2.1-mistral-7b

Mistral-7B-v0.1 fine tuned for chat with the Dolphin dataset (an open-source implementation of Microsoft's Orca)

12.1K runs
Public

lucataco/mistrallite

MistralLiteA is a fine-tuned Mistral-7B-v0.1 language model, with enhanced capabilities of processing long context (up to 32K tokens)

620 runs
Public

lucataco/bakllava

BakLLaVA-1 is a Mistral 7B base augmented with the LLaVA 1.5 architecture

38.4K runs
Public

lucataco/hotshot-xl

😊 Hotshot-XL is an AI text-to-GIF model trained to work alongside Stable Diffusion XL

43.7K runs
Public

lucataco/fuyu-8b

Fuyu-8B is a multi-modal text and image transformer trained by Adept AI

4.1K runs
Public

lucataco/video-crafter

Open diffusion model for high-quality video generation

8.9K runs
Public

lucataco/qwen-vl-chat

A multimodal LLM-based AI assistant, which is trained with alignment techniques. Qwen-VL-Chat supports more flexible interaction, such as multi-round question answering, and creative capabilities.

179.3K runs
Public

lucataco/comfyui-sdxl-txt2img

Using a ComfyUI workflow to run SDXL text2img

415 runs
Public

lucataco/sadtalker

Stylized Audio-Driven Single Image Talking Face Animation

10.3K runs
Public

lucataco/sdxl-controlnet

SDXL ControlNet - Canny

905.4K runs
Public

lucataco/mistral-7b-v0.1

Mistral-7B-v0.1 is a pretrained generative text model that outperforms Llama 2 13B on all benchmarks

2.1K runs
Public

lucataco/animate-diff

Animate Your Personalized Text-to-Image Diffusion Models

185K runs
Public

lucataco/illusion-diffusion-hq

Monster Labs QrCode ControlNet on top of SD Realistic Vision v5.1

295.5K runs
Public

lucataco/remove-bg

Remove background from an image

929.1K runs
Public

lucataco/realvisxl-v1.0

Implementation of SDXL RealVisXL_V1.0

43.4K runs
Public

lucataco/sdxl-controlnet-depth

SDXL ControlNet - Depth

29.5K runs
Public

lucataco/clip-interrogator

CLIP Interrogator (for faster inference)

101.3K runs
Public

lucataco/sdxl-panoramic

360 Panorama SDXL image with inpainted wrapping seam

5.7K runs
Public

lucataco/codeformer

Robust face restoration algorithm for old photos/AI-generated faces - (A40 GPU)

257.8K runs
Public

lucataco/blueprint

An SDXL fine-tune based on blueprints

196 runs
Public

lucataco/ms-img2vid

Turn any image into a video

1.2M runs
Public

lucataco/wizardcoder-python-34b-v1.0

Empowering Code Large Language Models with Evol-Instruct

851 runs
Public

lucataco/realistic-vision-v5-openpose

Realistic Vision V5 with OpenPose

5K runs
Public

lucataco/spider-gwen-style

SDXL fine tune on Spider-Gwen style

166 runs
Public

lucataco/realistic-vision-v5

Realistic Vision v5.0 with VAE

9.5K runs
Public

lucataco/sdxl-controlnet-openpose

SDXL ControlNet - OpenPose

21.1K runs
Public

lucataco/realistic-vision-v5-inpainting

Realistic Vision v5.0 Inpainting

24.9K runs
Public

lucataco/realistic-vision-v5-img2img

Realistic Vision v5.0 Image 2 Image

124K runs
Public

lucataco/realistic-vision-v5.1

Implementation of Realistic Vision v5.1 with VAE

346K runs
Public

lucataco/sdxl-clip-interrogator

CLIP Interrogator for SDXL optimizes text prompts to match a given image

821.7K runs
Public

lucataco/upstage-llama-2-70b-instruct-v2

Upstage/Llama-2-70B-instruct-v2 - GPTQ

1.6K runs
Public

lucataco/glaive-function-calling-v1

2.7B param open source chat model trained on Glaive’s synthetic data generation platform

184 runs
Public

lucataco/gfpgan

Practical face restoration algorithm for *old photos* or *AI-generated faces* (for larger images)

106.4K runs
Public

lucataco/freewilly2

Stability AI's FreeWilly2

309 runs
Public

lucataco/llama-2-70b-chat

Meta's Llama 2 70b Chat - GPTQ

652 runs
Public

lucataco/llama-2-13b-chat

Meta's Llama 2 13b Chat - GPTQ

17.9K runs
Public

lucataco/llama-2-7b-chat

Meta's Llama 2 7b Chat - GPTQ

20.1K runs
Public

lucataco/speaker-diarization

Segments an audio recording based on who is speaking (on A100)

8.1K runs
Public

lucataco/rivers-stable-diffusion-upscaler

RiversHaveWings Stable Diffusion Upscaler

405 runs
Public

lucataco/real-esrgan

Real-ESRGAN with optional face correction and adjustable upscale (for larger images)

22K runs
Public

lucataco/wsrglow

A working wsrglow model

248 runs
Public

lucataco/stable-diffusion-image-variation

Image Variations with Stable Diffusion

501 runs
Public

lucataco/realistic-vision-v4.0

Realistic Vision V4.0

52.4K runs
Public

lucataco/realistic-vision-v3.0

Realistic Vision V3.0 with VAE

4.7K runs
Public

lucataco/instruct-glaive

sahil2801/replit-code-instruct-glaive

359 runs
Public

lucataco/xgen-7b-8k-base

Salesforce/xgen-7b-8k-base

106 runs
Public

lucataco/vicuna-13b-v1.3

lmsys/vicuna-13b-v1.3

6.9K runs
Public

lucataco/vicuna-7b-v1.3

lmsys/vicuna-7b-v1.3

11.1K runs
Public

lucataco/codegen2-1b

Salesforce/codegen2-1B

580 runs
Public

lucataco/tiny-starcoder-py

bigcode/tiny_starcoder_py

71 runs
Public

lucataco/wizardcoder-15b-v1.0

WizardLM/WizardCoder-15B-V1.0

2.9K runs
Public

lucataco/shiba-diffusion

Shiba stable diffusion model

677 runs
Public

lucataco/idefics-80b

IDEFICS 80b Quantized

220 runs
Public

lucataco/wizardcoder-15b-v1

WizardLM/WizardCoder-15B-V1.0 in 4bit

459 runs
Public

lucataco/replit-code-v1-3b

replit/replit-code-v1-3b

124 runs
Public

lucataco/idefics-9b

IDEFICS 9b Quantized

2.1K runs
Public

lucataco/mpt-30b-chat

mosaicml/mpt-30b-chat in 8bit

2.9K runs
Public

lucataco/vicuna-33b-v1.3

lmsys/vicuna-33b-v1.3

109.7K runs
Public

lucataco/phi-1.5

microsoft/phi-1.5 was trained using the same data sources as phi-1, augmented with a new data source that consists of various NLP synthetic texts

730 runs
Public

lucataco/instant-id-lcm

InstantID with LCM

662 runs
Public