Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding (Updated 5 months, 2 weeks ago)