lucataco/interactiveomni-8b
A unified omni-modal model that can simultaneously receive inputs such as images, audio, text, and video and directly generate coherent text and speech
-
- Author
-
@lucataco
- Version
- cuda12.4-python3.11-X64
6d19412f
Latest