Multimodal Consistency-Based Teacher for Semi-Supervised Multimodal Sentiment Analysis

被引:0
作者
Yuan, Ziqi [1 ]
Fang, Jingliang [1 ,2 ]
Xu, Hua [1 ,2 ]
Gao, Kai [3 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, State Key Lab Intelligent Technol & Syst, Beijing 100084, Peoples R China
[2] Samton Jiangxi Technol Dev Co Ltd, Nanchang 330036, Peoples R China
[3] Hebei Univ Sci & Technol, Sch Informat Sci & Engn, Shijiazhuang 050018, Peoples R China
基金
中国国家自然科学基金;
关键词
Task analysis; Sentiment analysis; Visualization; Training; Speech processing; Semisupervised learning; Image classification; Consistency-based semi-supervised learning; multimodal sentiment analysis; pseudo-label filtering;
D O I
10.1109/TASLP.2024.3430543
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Multimodal sentiment analysis holds significant importance within the realm of human-computer interaction. Due to the ease of collecting unlabeled online resources compared to the high costs associated with annotation, it becomes imperative for researchers to develop semi-supervised methods that leverage unlabeled data to enhance model performance. Existing semi-supervised approaches, particularly those applied to trivial image classification tasks, are not suitable for multimodal regression tasks due to their reliance on task-specific augmentation and thresholds designed for classification tasks. To address this limitation, we propose the Multimodal Consistency-based Teacher (MC-Teacher), which incorporates consistency-based pseudo-label technique into semi-supervised multimodal sentiment analysis. In our approach, we first propose synergistic consistency assumption which focus on the consistency among bimodal representation. Building upon this assumption, we develop a learnable filter network that autonomously learns how to identify misleading instances instead of threshold-based methods. This is achieved by leveraging both the implicit discriminant consistency on unlabeled instances and the explicit guidance on constructed training data with labeled instances. Additionally, we design the self-adaptive exponential moving average strategy to decouple the student and teacher networks, utilizing a heuristic momentum coefficient. Through both quantitative and qualitative experiments on two benchmark datasets, we demonstrate the outstanding performances of the proposed MC-Teacher approach. Furthermore, detailed analysis experiments and case studies are provided for each crucial component to intuitively elucidate the inner mechanism and further validate their effectiveness.
引用
收藏
页码:3669 / 3683
页数:15
相关论文
共 50 条
  • [41] Semi-supervised distributed representations of documents for sentiment analysis
    Park, Saerom
    Lee, Jaewook
    Kim, Kyoungok
    NEURAL NETWORKS, 2019, 119 : 139 - 150
  • [42] FMixAugment for Semi-supervised Learning with Consistency Regularization
    Lin, Huibin
    Wang, Shiping
    Liu, Zhanghui
    Xiao, Shunxin
    Du, Shide
    Guo, Wenzhong
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II, 2021, 13020 : 127 - 139
  • [43] Multimodal Sentiment Analysis With Two-Phase Multi-Task Learning
    Yang, Bo
    Wu, Lijun
    Zhu, Jinhua
    Shao, Bo
    Lin, Xiaola
    Liu, Tie-Yan
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2015 - 2024
  • [44] Diversity and Balance: Multimodal Sentiment Analysis Using Multimodal-Prefixed and Cross-Modal Attention
    Li, Meng
    Zhu, Zhenfang
    Li, Kefeng
    Pei, Hongli
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2025, 16 (01) : 250 - 263
  • [45] MaxMatch: Semi-Supervised Learning With Worst-Case Consistency
    Jiang, Yangbangyan
    Li, Xiaodan
    Chen, Yuefeng
    He, Yuan
    Xu, Qianqian
    Yang, Zhiyong
    Cao, Xiaochun
    Huang, Qingming
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (05) : 5970 - 5987
  • [46] Trustworthy Multimodal Fusion for Sentiment Analysis in Ordinal Sentiment Space
    Xie, Zhuyang
    Yang, Yan
    Wang, Jie
    Liu, Xiaorong
    Li, Xiaofan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7657 - 7670
  • [47] Semi-Supervised Neuron Segmentation via Reinforced Consistency Learning
    Huang, Wei
    Chen, Chang
    Xiong, Zhiwei
    Zhang, Yueyi
    Chen, Xuejin
    Sun, Xiaoyan
    Wu, Feng
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (11) : 3016 - 3028
  • [48] Fcdnet: Fuzzy Cognition-Based Dynamic Fusion Network for Multimodal Sentiment Analysis
    Liu, Shuai
    Luo, Zhe
    Fu, Weina
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2025, 33 (01) : 3 - 14
  • [49] Dual-Perspective Fusion Network for Aspect-Based Multimodal Sentiment Analysis
    Wang, Di
    Tian, Changning
    Liang, Xiao
    Zhao, Lin
    He, Lihuo
    Wang, Quan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 (4028-4038) : 4028 - 4038
  • [50] On Consistency of Graph-based Semi-supervised Learning
    Du, Chengan
    Zhao, Yunpeng
    Wang, Feng
    2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, : 483 - 491