A text guided multi-task learning network for multimodal sentiment analysis

被引:10
作者
Luo, Yuanyi [1 ]
Wu, Rui [1 ]
Liu, Jiafeng [1 ]
Tang, Xianglong [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
关键词
Multimodal sentiment analysis; Representation learning; Multi-task learning; FUSION;
D O I
10.1016/j.neucom.2023.126836
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal Sentiment Analysis (MSA) is an active area of research that leverages multimodal signals for affective understanding of user-generated videos. Existing research tends to develop sophisticated fusion techniques to fuse unimodal representations into multimodal representation and treat MSA as a single prediction task. However, we find that the text modality with the pre-trained model (BERT) learn more semantic information and dominates the training in multimodal models, inhibiting the learning of other modalities. Besides, the classification ability of each modality is also suppressed by single-task learning. In this paper, We propose a text guided multi-task learning network to enhance the semantic information of non-text modalities and improve the learning ability of unimodal networks. We conducted experiments on multimodal sentiment analysis datasets, CMU-MOSI, CMU-MOSEI, and CH-SIMS. The results show that our method outperforms the current SOTA method.
引用
收藏
页数:8
相关论文
共 50 条
[31]   Multi-task learning for abstractive text summarization with key information guide network [J].
Weiran Xu ;
Chenliang Li ;
Minghao Lee ;
Chi Zhang .
EURASIP Journal on Advances in Signal Processing, 2020
[32]   A Multi-Task Learning Approach to Hate Speech Detection Leveraging Sentiment Analysis [J].
Plaza-Del-Arco, Flor Miriam ;
Molina-Gonzalez, M. Dolores ;
Urena-Lopez, L. Alfonso ;
Martin-Valdivia, Maria Teresa .
IEEE ACCESS, 2021, 9 :112478-112489
[33]   Multi-task learning for abstractive text summarization with key information guide network [J].
Xu, Weiran ;
Li, Chenliang ;
Lee, Minghao ;
Zhang, Chi .
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2020, 2020 (01)
[34]   MTLFormer: Multi-Task Learning Guided Transformer Network for Business Process Prediction [J].
Wang, Jiaojiao ;
Huang, Jiawei ;
Ma, Xiaoyu ;
Li, Zhongjin ;
Wang, Yaqi ;
Yu, Dingguo .
IEEE ACCESS, 2023, 11 :76722-76738
[35]   Multi-Task Learning and Multimodal Fusion for Road Segmentation [J].
Cheng, Bowen ;
Tian, Miaomiao ;
Jiang, Shuai ;
Liu, Weiwei ;
Pang, Yalong .
IEEE ACCESS, 2023, 11 :18947-18959
[36]   Multi-task Learning for Brain Network Analysis in the ABCD Study [J].
Kan, Xuan ;
Cui, Hejie ;
Han, Keqi ;
Guo, Ying ;
Yang, Carl .
2024 IEEE EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS, BHI, 2024,
[37]   Multi-task learning framework using tri-encoder with caption prompt for multimodal aspect-based sentiment analysis [J].
Cai, Yuanyuan ;
Tong, Fei ;
Zhang, Qingchuan ;
Xiong, Haitao .
JOURNAL OF SUPERCOMPUTING, 2025, 81 (06)
[38]   Adaptive multi-task learning for speech to text translation [J].
Feng, Xin ;
Zhao, Yue ;
Zong, Wei ;
Xu, Xiaona .
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2024, 2024 (01)
[39]   Aspect-invariant Sentiment Features Learning: Adversarial Multi-task Learning for Aspect-based Sentiment Analysis [J].
Liang, Bin ;
Yin, Rongdi ;
Gui, Lin ;
Du, Jiachen ;
He, Yulan ;
Xu, Ruifeng .
CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, :825-834
[40]   Multi-task network embedding [J].
Linchuan Xu ;
Xiaokai Wei ;
Jiannong Cao ;
Philip S. Yu .
International Journal of Data Science and Analytics, 2019, 8 :183-198