
adirik / flux-cinestill
Flux lora, use "CNSTLL" to trigger

adirik / flux-fantasy-architecture
Flux lora, use "in the style of FNTSYRCH" to trigger

adirik / interior-design
Realistic interior design with text and image inputs

adirik / leditsplusplus
LEdits++ for image editing

adirik / text2tex
[Non-commercial] Generate texture for 3D assets using text descriptions

adirik / prompt-to-prompt-realvisxl-3.0
Image editing with Prompt-to-Prompt for RealVisXL-v3.0

adirik / sdxl-prompt-to-prompt
Image editing with Prompt-to-Prompt for SDXL

adirik / vila-7b
[Non-commerical] A multi-image visual language model

adirik / vila-2.7b
[Non-commerical] A multi-image visual language model

adirik / realvisxl-v4.0-lightning
Photorealism with RealVisXL V4.0 Lightning

adirik / realistic-vision-v6.0
Photorealism with Realistic Vision v6.0

adirik / stylemc
Text-guided image generation and editing

adirik / gaussiandreamer
Fast text-to-3D Gaussian generation by bridging 2D and 3D diffusion models

adirik / wonder3d
Generates 3D assets from images

adirik / bunny-phi-2-siglip
Lightweight multimodal model for visual question answering, reasoning and captioning

adirik / multilingual-e5-small
Multilingual E5-small language embedding model

adirik / multilingual-e5-base
Multilingual E5-large language embedding model

adirik / multilingual-e5-large
Multilingual E5-large language embedding model

adirik / e5-mistral-7b-instruct
E5-mistral-7b-instruct language embedding model

adirik / realvisxl-v4.0
Photorealism with RealVisXL V4.0

adirik / mamba-2.8b-chat
Mamba 2.8B state space language model fine tuned for chat

adirik / mamba-2.8b
Base version of Mamba 2.8B, a 2.8 billion parameter state space language model

adirik / mamba-130m
Base version of Mamba 130M, a 130 million parameter state space language model

adirik / mamba-370m
Base version of Mamba 370M, a 370 million parameter state space language model

adirik / mamba-790m
Base version of Mamba 790M, a 790 million parameter state space language model

adirik / mamba-2.8b-slimpj
Base version of Mamba 2.8B Slim Pyjama, a 2.8 billion parameter state space language model

adirik / mamba-1.4b
Base version of Mamba 1.4B, a 1.4 billion parameter state space language model

adirik / styletts2
Generates speech from text

adirik / syncdiffusion
Generate panoramic images with text prompts

adirik / realvisxl-v3.0-turbo
Photorealism with RealVisXL V3.0 Turbo based on SDXL

adirik / imagedream
Image-Prompt Multi-view Diffusion for 3D Generation

adirik / marigold
Monocular depth estimation

adirik / hierspeechpp
Zero-shot speech synthesizer for text-to-speech and voice conversion

adirik / local-prompt-mixing
Generating object-level shape variations with Stable Diffusion

adirik / masactrl-sdxl
Editable image generation with MasaCtrl-SDXL

adirik / kosmos-g
Kosmos-G: Generating Images in Context with Multimodal Large Language Models

adirik / masactrl-anything-v4-0
Edit real or generated images

adirik / masactrl-stable-diffusion-v1-4
Edit real or generated images

adirik / texture
Generate texture for your mesh with text prompts

adirik / titanet-large
Performs speaker identity verification

adirik / mvdream
Generate 3D assets using text descriptions

adirik / codet
Detects objects in an image

adirik / t2i-adapter-sdxl-lineart
Modify images using line art

adirik / t2i-adapter-sdxl-canny
Modify images using canny edges

adirik / t2i-adapter-sdxl-sketch
Modify images using sketches

adirik / t2i-adapter-sdxl-openpose
Modify images using human pose

adirik / t2i-adapter-sdxl-depth-midas
Modify images using depth maps

adirik / grounding-dino
Detect everything with language!

adirik / deforum-kandinsky-2-2
Generate videos from text prompts with Kandinsky-2.2

adirik / owlvit-base-patch32
Zero-shot / open vocabulary object detection

adirik / inst-inpaint
Inst-Inpaint: Instructing to Remove Objects with Diffusion Models

adirik / lightweight-openpose
PyTorch version of Lightweight OpenPose as introduced in "Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose"

adirik / stylemc-old
Text-Guided Image Generation and Manipulation

adirik / nougat
Nougat: Neural Optical Understanding for Academic Documents

adirik / dat
Dual Aggregation Transformer for Image Super-Resolution

adirik / udop-large
Performs document image classification, document parsing and document visual question answering

adirik / dreamgaussian
DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation

adirik / vila-13b
[Non-commerical] A multi-image visual language model

adirik / seamless-expressive
Multilingual speech translation that preserves original vocal style and prosody

adirik / mvdream-multi-view
Multi-view image generation with MVDream

adirik / dwpose
Whole-body pose estimation