adirik

Alara Dirik

GitHub

mvdream

Generate 3D assets using text descriptions

Updated 779 runs

dreamgaussian

DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation

Updated 6.9K runs

wonder3d

Generates 3D assets from images

Updated 1.9K runs

bunny-phi-2-siglip

Lightweight multimodal model for visual question answering, reasoning and captioning

Updated 32 runs

multilingual-e5-small

Multilingual E5-small language embedding model

Updated 1 run

multilingual-e5-base

Multilingual E5-large language embedding model

Updated 6 runs

multilingual-e5-large

Multilingual E5-large language embedding model

Updated 2 runs

e5-mistral-7b-instruct

E5-mistral-7b-instruct language embedding model

Updated 24 runs

realvisxl-v4.0

Photorealism with RealVisXL V4.0

Updated 1.8K runs

mamba-2.8b-chat

Mamba 2.8B state space language model fine tuned for chat

Updated 49 runs

prompt-to-prompt-realvisxl-3.0

Image editing with Prompt-to-Prompt for RealVisXL-v3.0

Updated 134 runs

sdxl-prompt-to-prompt

Image editing with Prompt-to-Prompt for SDXL

Updated 90 runs

mamba-2.8b

Base version of Mamba 2.8B, a 2.8 billion parameter state space language model

Updated 125 runs

mamba-130m

Base version of Mamba 130M, a 130 million parameter state space language model

Updated 33 runs

mamba-370m

Base version of Mamba 370M, a 370 million parameter state space language model

Updated 9 runs

mamba-790m

Base version of Mamba 790M, a 790 million parameter state space language model

Updated 13 runs

mamba-2.8b-slimpj

Base version of Mamba 2.8B Slim Pyjama, a 2.8 billion parameter state space language model

Updated 28 runs

mamba-1.4b

Base version of Mamba 1.4B, a 1.4 billion parameter state space language model

Updated 17 runs

styletts2

Generates speech from text

Updated 80.4K runs

syncdiffusion

Generate panoramic images with text prompts

Updated 91 runs

dwpose

Whole-body pose estimation

Updated 108 runs

realvisxl-v3.0-turbo

Photorealism with RealVisXL V3.0 Turbo based on SDXL

Updated 26.8K runs

imagedream

Image-Prompt Multi-view Diffusion for 3D Generation

Updated 551 runs

dat

Dual Aggregation Transformer for Image Super-Resolution

Updated 134 runs

marigold

Monocular depth estimation

Updated 6.9K runs

hierspeechpp

Zero-shot speech synthesizer for text-to-speech and voice conversion

Updated 1.6K runs

local-prompt-mixing

Generating object-level shape variations with Stable Diffusion

Updated 66 runs

masactrl-sdxl

Editable image generation with MasaCtrl-SDXL

Updated 2.7K runs

kosmos-g

Kosmos-G: Generating Images in Context with Multimodal Large Language Models

Updated 2K runs

masactrl-anything-v4-0

Edit real or generated images

Updated 1K runs

masactrl-stable-diffusion-v1-4

Edit real or generated images

Updated 751 runs

texture

Generate texture for your mesh with text prompts

Updated 849 runs

titanet-large

Performs speaker identity verification

Updated 59 runs

codet

Detects objects in an image

Updated 1K runs

t2i-adapter-sdxl-lineart

Modify images using line art

Updated 45.4K runs

deforum-kandinsky-2-2

Generate videos from text prompts with Kandinsky-2.2

Updated 6.4K runs

mvdream-multi-view

Multi-view image generation with MVDream

Updated 430 runs

t2i-adapter-sdxl-canny

Modify images using canny edges

Updated 11.3K runs

t2i-adapter-sdxl-sketch

Modify images using sketches

Updated 9.2K runs

t2i-adapter-sdxl-openpose

Modify images using human pose

Updated 2.9K runs

t2i-adapter-sdxl-depth-midas

Modify images using depth maps

Updated 54.8K runs

grounding-dino

Detect everything with language!

Updated 30.2K runs

owlvit-base-patch32

Zero-shot / open vocabulary object detection

Updated 8K runs

inst-inpaint

Inst-Inpaint: Instructing to Remove Objects with Diffusion Models

Updated 247 runs

nougat

Nougat: Neural Optical Understanding for Academic Documents

Updated 1.9K runs

lightweight-openpose

PyTorch version of Lightweight OpenPose as introduced in "Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose"

Updated 1.4K runs