A text guided multi-task learning network for multimodal sentiment analysis

被引:10
作者
Luo, Yuanyi [1 ]
Wu, Rui [1 ]
Liu, Jiafeng [1 ]
Tang, Xianglong [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
关键词
Multimodal sentiment analysis; Representation learning; Multi-task learning; FUSION;
D O I
10.1016/j.neucom.2023.126836
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal Sentiment Analysis (MSA) is an active area of research that leverages multimodal signals for affective understanding of user-generated videos. Existing research tends to develop sophisticated fusion techniques to fuse unimodal representations into multimodal representation and treat MSA as a single prediction task. However, we find that the text modality with the pre-trained model (BERT) learn more semantic information and dominates the training in multimodal models, inhibiting the learning of other modalities. Besides, the classification ability of each modality is also suppressed by single-task learning. In this paper, We propose a text guided multi-task learning network to enhance the semantic information of non-text modalities and improve the learning ability of unimodal networks. We conducted experiments on multimodal sentiment analysis datasets, CMU-MOSI, CMU-MOSEI, and CH-SIMS. The results show that our method outperforms the current SOTA method.
引用
收藏
页数:8
相关论文
共 50 条
[21]   Sentiment Analysis and Sarcasm Detection using Deep Multi-Task Learning [J].
Tan, Yik Yang ;
Chow, Chee-Onn ;
Kanesan, Jeevan ;
Chuah, Joon Huang ;
Lim, YongLiang .
WIRELESS PERSONAL COMMUNICATIONS, 2023, 129 (03) :2213-2237
[22]   Multi-Task Learning for Sentiment Analysis with Hard-Sharing and Task Recognition Mechanisms [J].
Zhang, Jian ;
Yan, Ke ;
Mo, Yuchang .
INFORMATION, 2021, 12 (05)
[23]   Shared and Private Information Learning in Multimodal Sentiment Analysis with Deep Modal Alignment and Self-supervised Multi-Task Learning [J].
Lai, Songning ;
Li, Jiakang ;
Guo, Guinan ;
Hu, Xifeng ;
Li, Yulong ;
Tan, Yuan ;
Song, Zichen ;
Liu, Yutong ;
Ren, Zhaoxia ;
Wang, Chun ;
Miao, Danmin ;
Liu, Zhi .
2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024, 2024,
[24]   Multi-modal Sentiment and Emotion Joint Analysis with a Deep Attentive Multi-task Learning Model [J].
Zhang, Yazhou ;
Rong, Lu ;
Li, Xiang ;
Chen, Rui .
ADVANCES IN INFORMATION RETRIEVAL, PT I, 2022, 13185 :518-532
[25]   Generative Multi-Task Learning for Text Classification [J].
Zhao, Wei ;
Gao, Hui ;
Chen, Shuhui ;
Wang, Nan .
IEEE ACCESS, 2020, 8 :86380-86387
[26]   Image-text Similarity Guided Fusion Network for Multimodal Aspect Sentiment Analysis [J].
Wei, Jiabing ;
Cao, Han .
2024 12TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS AND COMPUTING TECHNOLOGY, ISCTECH, 2024,
[27]   Metric-Guided Multi-task Learning [J].
Ren, Jinfu ;
Liu, Yang ;
Liu, Jiming .
FOUNDATIONS OF INTELLIGENT SYSTEMS (ISMIS 2020), 2020, 12117 :21-31
[28]   TASK AWARE MULTI-TASK LEARNING FOR SPEECH TO TEXT TASKS [J].
Indurthi, Sathish ;
Zaidi, Mohd Abbas ;
Lakumarapu, Nikhil Kumar ;
Lee, Beomseok ;
Han, Hyojung ;
Ahn, Seokchan ;
Kim, Sangha ;
Kim, Chanwoo ;
Hwang, Inchul .
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, :7723-7727
[29]   TGGS network: A multi-task learning network for gradient-guided knowledge sharing [J].
Huang, Yongjie ;
Han, Xiao ;
Chen, Man ;
Pan, Zhisong .
KNOWLEDGE-BASED SYSTEMS, 2024, 301
[30]   Network Clustering for Multi-task Learning [J].
Mu, Zhiying ;
Gao, Dehong ;
Guo, Sensen .
NEURAL PROCESSING LETTERS, 2025, 57 (01)