adirik/interior-design
Realistic interior design with text and image inputs
adirik/leditsplusplus
LEdits++ for image editing
adirik/text2tex
[Non-commercial] Generate texture for 3D assets using text descriptions
adirik/prompt-to-prompt-realvisxl-3.0
Image editing with Prompt-to-Prompt for RealVisXL-v3.0
adirik/sdxl-prompt-to-prompt
Image editing with Prompt-to-Prompt for SDXL
adirik/udop-large
Performs document image classification, document parsing and document visual question answering
adirik/vila-13b
[Non-commerical] A multi-image visual language model
adirik/vila-7b
[Non-commerical] A multi-image visual language model
adirik/vila-2.7b
[Non-commerical] A multi-image visual language model
adirik/seamless-expressive
Multilingual speech translation that preserves original vocal style and prosody
adirik/realvisxl-v4.0-lightning
Photorealism with RealVisXL V4.0 Lightning
adirik/realistic-vision-v6.0
Photorealism with Realistic Vision v6.0
adirik/stylemc
Text-guided image generation and editing
adirik/gaussiandreamer
Fast text-to-3D Gaussian generation by bridging 2D and 3D diffusion models
adirik/mvdream
Generate 3D assets using text descriptions
adirik/dreamgaussian
DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation
adirik/wonder3d
Generates 3D assets from images
adirik/bunny-phi-2-siglip
Lightweight multimodal model for visual question answering, reasoning and captioning
adirik/multilingual-e5-small
Multilingual E5-small language embedding model
adirik/multilingual-e5-base
Multilingual E5-large language embedding model
adirik/multilingual-e5-large
Multilingual E5-large language embedding model
adirik/e5-mistral-7b-instruct
E5-mistral-7b-instruct language embedding model
adirik/realvisxl-v4.0
Photorealism with RealVisXL V4.0
adirik/mamba-2.8b-chat
Mamba 2.8B state space language model fine tuned for chat
adirik/mamba-2.8b
Base version of Mamba 2.8B, a 2.8 billion parameter state space language model
adirik/mamba-130m
Base version of Mamba 130M, a 130 million parameter state space language model
adirik/mamba-370m
Base version of Mamba 370M, a 370 million parameter state space language model
adirik/mamba-790m
Base version of Mamba 790M, a 790 million parameter state space language model
adirik/mamba-2.8b-slimpj
Base version of Mamba 2.8B Slim Pyjama, a 2.8 billion parameter state space language model
adirik/mamba-1.4b
Base version of Mamba 1.4B, a 1.4 billion parameter state space language model
adirik/styletts2
Generates speech from text
adirik/syncdiffusion
Generate panoramic images with text prompts
adirik/dwpose
Whole-body pose estimation
adirik/realvisxl-v3.0-turbo
Photorealism with RealVisXL V3.0 Turbo based on SDXL
adirik/imagedream
Image-Prompt Multi-view Diffusion for 3D Generation
adirik/marigold
Monocular depth estimation
adirik/hierspeechpp
Zero-shot speech synthesizer for text-to-speech and voice conversion
adirik/local-prompt-mixing
Generating object-level shape variations with Stable Diffusion
adirik/masactrl-sdxl
Editable image generation with MasaCtrl-SDXL
adirik/kosmos-g
Kosmos-G: Generating Images in Context with Multimodal Large Language Models
adirik/masactrl-anything-v4-0
Edit real or generated images
adirik/masactrl-stable-diffusion-v1-4
Edit real or generated images
adirik/texture
Generate texture for your mesh with text prompts
adirik/titanet-large
Performs speaker identity verification
adirik/codet
Detects objects in an image
alaradirik/t2i-adapter-sdxl-lineart
Modify images using line art
alaradirik/deforum-kandinsky-2-2
Generate videos from text prompts with Kandinsky-2.2
alaradirik/t2i-adapter-sdxl-canny
Modify images using canny edges
alaradirik/t2i-adapter-sdxl-sketch
Modify images using sketches
alaradirik/t2i-adapter-sdxl-openpose
Modify images using human pose
alaradirik/t2i-adapter-sdxl-depth-midas
Modify images using depth maps
adirik/grounding-dino
Detect everything with language!
alaradirik/owlvit-base-patch32
Zero-shot / open vocabulary object detection
alaradirik/inst-inpaint
Inst-Inpaint: Instructing to Remove Objects with Diffusion Models
alaradirik/nougat
Nougat: Neural Optical Understanding for Academic Documents
alaradirik/lightweight-openpose
PyTorch version of Lightweight OpenPose as introduced in "Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose"
adirik/stylemc-old
Text-Guided Image Generation and Manipulation
adirik/dat
Dual Aggregation Transformer for Image Super-Resolution
adirik/mvdream-multi-view
Multi-view image generation with MVDream