A text guided multi-task learning network for multimodal sentiment analysis

被引:10
作者
Luo, Yuanyi [1 ]
Wu, Rui [1 ]
Liu, Jiafeng [1 ]
Tang, Xianglong [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
关键词
Multimodal sentiment analysis; Representation learning; Multi-task learning; FUSION;
D O I
10.1016/j.neucom.2023.126836
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal Sentiment Analysis (MSA) is an active area of research that leverages multimodal signals for affective understanding of user-generated videos. Existing research tends to develop sophisticated fusion techniques to fuse unimodal representations into multimodal representation and treat MSA as a single prediction task. However, we find that the text modality with the pre-trained model (BERT) learn more semantic information and dominates the training in multimodal models, inhibiting the learning of other modalities. Besides, the classification ability of each modality is also suppressed by single-task learning. In this paper, We propose a text guided multi-task learning network to enhance the semantic information of non-text modalities and improve the learning ability of unimodal networks. We conducted experiments on multimodal sentiment analysis datasets, CMU-MOSI, CMU-MOSEI, and CH-SIMS. The results show that our method outperforms the current SOTA method.
引用
收藏
页数:8
相关论文
共 50 条
[41]   Multi-Task Supervised Alignment Pre-Training for Few-Shot Multimodal Sentiment Analysis [J].
Yang, Junyang ;
Cao, Jiuxin ;
Duan, Chengge .
APPLIED SCIENCES-BASEL, 2025, 15 (04)
[42]   Collaborative community-specific microblog sentiment analysis via multi-task learning [J].
Zou Xiaomei ;
Yang Jing ;
Zhang Wei ;
Han Hongyu .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 169
[43]   Multi-Task Metric Learning on Network Data [J].
Fang, Chen ;
Rockmore, Daniel N. .
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PART I, 2015, 9077 :317-329
[44]   Multi-task network embedding [J].
Xu, Linchuan ;
Wei, Xiaokai ;
Cao, Jiannong ;
Yu, Philip S. .
INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2019, 8 (02) :183-198
[45]   Multi-Task Learning Based Network Embedding [J].
Wang, Shanfeng ;
Wang, Qixiang ;
Gong, Maoguo .
FRONTIERS IN NEUROSCIENCE, 2020, 13
[46]   A Multi-modal Sentiment Recognition Method Based on Multi-task Learning [J].
Lin Z. ;
Long Y. ;
Du J. ;
Xu R. .
Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2021, 57 (01) :7-15
[47]   Modeling More Globally: A Hierarchical Attention Network via Multi-Task Learning for Aspect-Based Sentiment Analysis [J].
Ran, Xiangying ;
Pan, Yuanyuan ;
Sun, Wei ;
Wang, Chongjun .
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2019, 11448 :505-509
[48]   Multi-Task Multi-Head Attention Memory Network for Fine-Grained Sentiment Analysis [J].
Dai, Zehui ;
Dai, Wei ;
Liu, Zhenhua ;
Rao, Fengyun ;
Chen, Huajie ;
Zhang, Guangpeng ;
Ding, Yadong ;
Liu, Jiyang .
NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING (NLPCC 2019), PT I, 2019, 11838 :609-620
[49]   Text-guided Multi-Task Image Aesthetic Quality Assessment [J].
Yang, Hongtao ;
Wang, Guolong ;
Liu, Yehui ;
Shi, Ping ;
Zhou, Xinghui ;
Jin, Xin .
PROCEEDINGS OF THE 2ND INTERNATIONAL WORKSHOP ON MULTIMEDIA CONTENT GENERATION AND EVALUATION: NEW METHODS AND PRACTICE, MCGE 2024, 2024, :11-19
[50]   Multi-task gradient descent for multi-task learning [J].
Lu Bai ;
Yew-Soon Ong ;
Tiantian He ;
Abhishek Gupta .
Memetic Computing, 2020, 12 :355-369