Robust Multimodal Representation under Uncertain Missing Modalities

被引:0
|
作者
Lan, Guilin [1 ]
Du, Yeqian [1 ]
Yang, Zhouwang [1 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
基金
国家重点研发计划;
关键词
Multimodal representation; Missing modalities; Multimodal sentiment analysis; Multimedia;
D O I
10.1145/3702003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multimodal representation learning has gained significant attention across various fields, yet it faces challenges when dealing with missing modalities in real-world applications. Existing solutions are confined to specific scenarios, such as single-modality missing or missing modalities in test cases, thereby restricting their applicability. To address a more general scenario of uncertain missing modalities in both training and testing framework projects each modality's representation into a shared subspace, enabling the reconstruction of any missing modalities within a unified model. We propose an interaction refinement module that utilizes cross-modal attention to enhance these reconstructions, particularly beneficial in scenarios with limited complete modality data. Furthermore, we introduce an iterative training strategy that alternately trains different modules to effectively utilize both complete and incomplete modality data. Experimental results on four benchmark datasets demonstrate the superiority of RMRU over existing baselines, particularly in scenarios with a high rate of missing modalities. Remarkably, our proposed RMRU can be broadly applied to diverse scenarios, regardless of modality types and quantities.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Robust Multimodal Sentiment Analysis via Tag Encoding of Uncertain Missing Modalities
    Zeng, Jiandian
    Zhou, Jiantao
    Liu, Tianyi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 6301 - 6314
  • [2] Tag-assisted Multimodal Sentiment Analysis under Uncertain Missing Modalities
    Zeng, Jiandian
    Liu, Tianyi
    Zhou, Jiantao
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1545 - 1554
  • [3] Towards Robust Multimodal Sentiment Analysis Under Uncertain Signal Missing
    Li, Mingcheng
    Yang, Dingkang
    Zhang, Lihua
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1497 - 1501
  • [4] Chameleon: A Multimodal Learning Framework Robust to Missing Modalities
    Muhammad Irzam Liaqat
    Shah Nawaz
    Muhammad Zaigham Zaheer
    Muhammad Saad Saeed
    Hassan Sajjad
    Tom De Schepper
    Karthik Nandakumar
    Muhammad Haris Khan
    Ignazio Gallo
    Markus Schedl
    International Journal of Multimedia Information Retrieval, 2025, 14 (2)
  • [5] Similar modality completion-based multimodal sentiment analysis under uncertain missing modalities
    Sun, Yuhang
    Liu, Zhizhong
    Sheng, Quan Z.
    Chu, Dianhui
    Yu, Jian
    Sun, Hongxiang
    INFORMATION FUSION, 2024, 110
  • [6] UniMF: A Unified Multimodal Framework for Multimodal Sentiment Analysis in Missing Modalities and Unaligned Multimodal Sequences
    Huan, Ruohong
    Zhong, Guowei
    Chen, Peng
    Liang, Ronghua
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 5753 - 5768
  • [7] Modality translation-based multimodal sentiment analysis under uncertain modalities
    Liu, Zhizhong
    Zhou, Bin
    Chu, Dianhui
    Sun, Yuhang
    Meng, Lingqiang
    INFORMATION FUSION, 2024, 101
  • [8] SSLMM: Semi-Supervised Learning with Missing Modalities for Multimodal Sentiment Analysis
    Wang, Yiyu
    Jian, Haifang
    Zhuang, Jian
    Guo, Huimin
    Leng, Yan
    INFORMATION FUSION, 2025, 120
  • [9] Robust multimodal federated learning for incomplete modalities
    Yu, Songcan
    Wang, Junbo
    Hussein, Walid
    Hung, Patrick C. K.
    COMPUTER COMMUNICATIONS, 2024, 214 : 234 - 243
  • [10] MMMViT: Multiscale multimodal vision transformer for brain tumor segmentation with missing modalities
    Qiu, Chengjian
    Song, Yuqing
    Liu, Yi
    Zhu, Yan
    Han, Kai
    Sheng, Victor S.
    Liu, Zhe
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 90