lucataco/qwen2.5-omni-7b | API reference

lucataco / qwen2.5-omni-7b

Qwen2.5-Omni is an end-to-end multimodal model designed to perceive diverse modalities, including text, images, audio, and video, while simultaneously generating text and natural speech responses in a streaming manner. (Updated 2 months, 1 week ago)

Public
2.8K runs
L40S
GitHub
Weights
Paper
License

Run with an API

Playground API Examples README Versions