Abstract: Multimodal Sentiment Analysis (MSA) aims to understand human emotions by integrating text, vision, and audio. In real-world scenarios, unaligned data exhibit dynamic and uneven modality ...