Explore

Collections

Image restoration

Models that improve or restore images by deblurring, colorization, and removing noise

tencentarc/gfpgan, jingyunliang/swinir, microsoft/bringing-old-photos-back-to-life, cjwbw/bigcolor, google-research/maxim...

Style transfer

Models that take a content image and a style reference to produce a new image

paper11667/clipstyler, huage001/adaattn, ptran1203/pytorch-animegan, sanzgiri/cartoonify_video, ariel415el/gpdm...

Super resolution

Upscaling models that create high-quality images from low-quality images

nightmareai/real-esrgan, jingyunliang/swinir, mv-lab/swin2sr, cjwbw/rudalle-sr, cjwbw/real-esrgan...

Latest models

Training-free Controllable Text-to-Video Generation

Updated 9 hours ago 21 runs

Updated 1 day, 10 hours ago 25 runs

This model can edit clothing found within an image, using a state of the art clothing segmentation algorithm.

Updated 2 days, 9 hours ago 519 runs

An instruction-tuned LLM that allows you to constrain syllable patterns

Updated 2 days, 13 hours ago 113 runs

Regression of musical arousal and valence values

Updated 2 days, 18 hours ago 3.9K runs

Classification of music approachability and engagement

Updated 2 days, 23 hours ago 1.6K runs

An EfficientNet for music style classification by 400 styles from the Discogs taxonomy

Updated 3 days ago 5.5K runs

My own personal try of Stable Diffusion

Updated 3 days, 4 hours ago 16 runs

Updated 4 days, 17 hours ago 356 runs

Transcribes any audio file (base64, url) with speaker diarization. *Please read instructions below*

Updated 4 days, 21 hours ago 878 runs

Updated 5 days, 4 hours ago 10 runs

Transformers implementation of the LLaMA language model

Updated 5 days, 10 hours ago 21.5K runs

A large language model that's been fine-tuned on ChatGPT interactions

Updated 5 days, 17 hours ago 34.7K runs

A multi-input ControlNet model. Pass in control images and set the weights.

Updated 6 days, 16 hours ago 52 runs

Generate subtitles (.srt and .vtt) from audio files using OpenAI's Whisper models.

Updated 1 week ago 63 runs

controlnet 1.1 lineart x realistic-vision-v2.0

Updated 1 week ago 10.2K runs

Generating Conditional 3D Implicit Functions

Updated 1 week, 1 day ago 3.6K runs

Tuning-Free Multi-Subject Image Generation with Localized Attention

Updated 1 week, 1 day ago 660 runs

An instruction-tuned multi-modal model based on BLIP-2 and Vicuna-13B

Updated 1 week, 1 day ago 12.4K runs

Updated 1 week, 1 day ago 164 runs

Image captioning via vision-language models with instruction tuning

Updated 1 week, 2 days ago 271 runs

Generate Pokémon from a text description

Updated 1 week, 2 days ago 6.7M runs

A model for text, audio, and image embeddings in one space

Updated 1 week, 3 days ago 125 runs

music label

Updated 1 week, 3 days ago 42 runs

image tagger

Updated 1 week, 3 days ago 211 runs

SDRV_2.0

Updated 1 week, 3 days ago 6.7K runs

A model which generates text in response to an input image and prompt.

Updated 1 week, 4 days ago 2.2K runs

ControlNet annotators - the initial image that is fed into a stable diffusion pipeline with ControlNet

Updated 1 week, 4 days ago 66 runs

Detects tents in satellite images

Updated 1 week, 4 days ago 20 runs

Generate a new image given any input text with RPG V4

Updated 1 week, 5 days ago 702 runs

Generate a new image from an input image with Edge Of Realism - EOR v2.0

Updated 1 week, 5 days ago 6.9K runs

Generate a new image from an input image with Deliberate v2

Updated 1 week, 6 days ago 1.5K runs

album cover generator

Updated 1 week, 6 days ago 582 runs

Generate a new image from an input image with Realistic Vision V2.0

Updated 1 week, 6 days ago 3.9K runs

Image Inpainting

Updated 2 weeks ago 3.1K runs

Updated 2 weeks ago 181 runs

Stylized Audio-Driven Single Image Talking Face Animation

Updated 2 weeks, 1 day ago 1.4K runs

Product advertising image generator

Updated 2 weeks, 1 day ago 26.8K runs

An instruction-tuned multimodal large language model that generates text based on user-provided prompts and images

Updated 2 weeks, 2 days ago 1.8K runs

Generate a new image given any input text with URPM v1.3

Updated 2 weeks, 3 days ago 958 runs

Generate a new image given any input text with Deliberate v2

Updated 2 weeks, 3 days ago 11.5K runs

Generate a new image given any input text with Realistic Vision V2.0

Updated 2 weeks, 3 days ago 14.6K runs

Generate a new image given any input text with Edge Of Realism - EOR v2.0

Updated 2 weeks, 3 days ago 14.8K runs

Generate a new image given any input text with Babes 2.0

Updated 2 weeks, 4 days ago 3.2K runs

A 7B parameter LLM fine-tuned to support contexts with more than 65K tokens

Updated 2 weeks, 4 days ago 4.3K runs

7B parameter base version of Stability AI's language model

Updated 2 weeks, 4 days ago 165 runs

Consistent view characters with ControlNet and Stable Diffusion fine-tuned on Ready Player Me characters based on OpenJourneyV4

Updated 2 weeks, 5 days ago 580 runs

Updated 2 weeks, 6 days ago 79 runs

3B parameter base version of Stability AI's language model

Updated 3 weeks, 4 days ago 39 runs

Object Detector Using Yolo

Updated 3 weeks, 4 days ago 204 runs