Versions – lucataco/interactiveomni-8b | Replicate

A unified omni-modal model that can simultaneously receive inputs such as images, audio, text, and video and directly generate coherent text and speech

Public

87 runs

License

GitHub

Weights

Paper

Playground API Examples README Versions

8 months, 2 weeks ago

Author

@lucataco

Version

cuda12.4-python3.11-X64

Commit

ad40a08da114637a031125d4546de17e34892f17

6d19412f
Latest