adirik / flux-cinestill
Flux lora, use "CNSTLL" to trigger
adirik / flux-fantasy-architecture
Flux lora, use "in the style of FNTSYRCH" to trigger
adirik / interior-design
Realistic interior design with text and image inputs
adirik / leditsplusplus
LEdits++ for image editing
adirik / text2tex
[Non-commercial] Generate texture for 3D assets using text descriptions
adirik / prompt-to-prompt-realvisxl-3.0
Image editing with Prompt-to-Prompt for RealVisXL-v3.0
adirik / sdxl-prompt-to-prompt
Image editing with Prompt-to-Prompt for SDXL
adirik / vila-7b
[Non-commerical] A multi-image visual language model
adirik / vila-2.7b
[Non-commerical] A multi-image visual language model
adirik / realvisxl-v4.0-lightning
Photorealism with RealVisXL V4.0 Lightning
adirik / realistic-vision-v6.0
Photorealism with Realistic Vision v6.0
adirik / stylemc
Text-guided image generation and editing
adirik / gaussiandreamer
Fast text-to-3D Gaussian generation by bridging 2D and 3D diffusion models
adirik / wonder3d
Generates 3D assets from images
adirik / bunny-phi-2-siglip
Lightweight multimodal model for visual question answering, reasoning and captioning
adirik / multilingual-e5-small
Multilingual E5-small language embedding model
adirik / multilingual-e5-base
Multilingual E5-large language embedding model
adirik / multilingual-e5-large
Multilingual E5-large language embedding model
adirik / e5-mistral-7b-instruct
E5-mistral-7b-instruct language embedding model
adirik / realvisxl-v4.0
Photorealism with RealVisXL V4.0
adirik / mamba-2.8b-chat
Mamba 2.8B state space language model fine tuned for chat
adirik / mamba-2.8b
Base version of Mamba 2.8B, a 2.8 billion parameter state space language model
adirik / mamba-130m
Base version of Mamba 130M, a 130 million parameter state space language model
adirik / mamba-370m
Base version of Mamba 370M, a 370 million parameter state space language model
adirik / mamba-790m
Base version of Mamba 790M, a 790 million parameter state space language model
adirik / mamba-2.8b-slimpj
Base version of Mamba 2.8B Slim Pyjama, a 2.8 billion parameter state space language model
adirik / mamba-1.4b
Base version of Mamba 1.4B, a 1.4 billion parameter state space language model
adirik / styletts2
Generates speech from text
adirik / syncdiffusion
Generate panoramic images with text prompts
adirik / realvisxl-v3.0-turbo
Photorealism with RealVisXL V3.0 Turbo based on SDXL
adirik / imagedream
Image-Prompt Multi-view Diffusion for 3D Generation
adirik / marigold
Monocular depth estimation
adirik / hierspeechpp
Zero-shot speech synthesizer for text-to-speech and voice conversion
adirik / local-prompt-mixing
Generating object-level shape variations with Stable Diffusion
adirik / masactrl-sdxl
Editable image generation with MasaCtrl-SDXL
adirik / kosmos-g
Kosmos-G: Generating Images in Context with Multimodal Large Language Models
adirik / masactrl-anything-v4-0
Edit real or generated images
adirik / masactrl-stable-diffusion-v1-4
Edit real or generated images
adirik / texture
Generate texture for your mesh with text prompts
adirik / titanet-large
Performs speaker identity verification
adirik / mvdream
Generate 3D assets using text descriptions
adirik / codet
Detects objects in an image
adirik / t2i-adapter-sdxl-lineart
Modify images using line art
adirik / t2i-adapter-sdxl-canny
Modify images using canny edges
adirik / t2i-adapter-sdxl-sketch
Modify images using sketches
adirik / t2i-adapter-sdxl-openpose
Modify images using human pose
adirik / t2i-adapter-sdxl-depth-midas
Modify images using depth maps
adirik / grounding-dino
Detect everything with language!
adirik / deforum-kandinsky-2-2
Generate videos from text prompts with Kandinsky-2.2
adirik / owlvit-base-patch32
Zero-shot / open vocabulary object detection
adirik / inst-inpaint
Inst-Inpaint: Instructing to Remove Objects with Diffusion Models
adirik / lightweight-openpose
PyTorch version of Lightweight OpenPose as introduced in "Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose"
adirik / stylemc-old
Text-Guided Image Generation and Manipulation
adirik / seamless-expressive
Multilingual speech translation that preserves original vocal style and prosody
adirik / udop-large
Performs document image classification, document parsing and document visual question answering
adirik / dreamgaussian
DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation
adirik / nougat
Nougat: Neural Optical Understanding for Academic Documents
adirik / mvdream-multi-view
Multi-view image generation with MVDream
adirik / dat
Dual Aggregation Transformer for Image Super-Resolution
adirik / vila-13b
[Non-commerical] A multi-image visual language model
adirik / dwpose
Whole-body pose estimation