A text guided multi-task learning network for multimodal sentiment analysis

被引:6
|
作者
Luo, Yuanyi [1 ]
Wu, Rui [1 ]
Liu, Jiafeng [1 ]
Tang, Xianglong [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
关键词
Multimodal sentiment analysis; Representation learning; Multi-task learning; FUSION;
D O I
10.1016/j.neucom.2023.126836
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal Sentiment Analysis (MSA) is an active area of research that leverages multimodal signals for affective understanding of user-generated videos. Existing research tends to develop sophisticated fusion techniques to fuse unimodal representations into multimodal representation and treat MSA as a single prediction task. However, we find that the text modality with the pre-trained model (BERT) learn more semantic information and dominates the training in multimodal models, inhibiting the learning of other modalities. Besides, the classification ability of each modality is also suppressed by single-task learning. In this paper, We propose a text guided multi-task learning network to enhance the semantic information of non-text modalities and improve the learning ability of unimodal networks. We conducted experiments on multimodal sentiment analysis datasets, CMU-MOSI, CMU-MOSEI, and CH-SIMS. The results show that our method outperforms the current SOTA method.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] A Neural Network Based on WXLNet and Multi-Task Lable Embedding for Sentiment Analysis
    Xie, Chenxi
    Meng, Zhongvi
    Song, Bo
    Jiang, Guoping
    Song, Yurong
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 2359 - 2366
  • [32] TASK AWARE MULTI-TASK LEARNING FOR SPEECH TO TEXT TASKS
    Indurthi, Sathish
    Zaidi, Mohd Abbas
    Lakumarapu, Nikhil Kumar
    Lee, Beomseok
    Han, Hyojung
    Ahn, Seokchan
    Kim, Sangha
    Kim, Chanwoo
    Hwang, Inchul
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7723 - 7727
  • [33] TGGS network: A multi-task learning network for gradient-guided knowledge sharing
    Huang, Yongjie
    Han, Xiao
    Chen, Man
    Pan, Zhisong
    KNOWLEDGE-BASED SYSTEMS, 2024, 301
  • [34] Multi-Task Network Representation Learning
    Xie, Yu
    Jin, Peixuan
    Gong, Maoguo
    Zhang, Chen
    Yu, Bin
    FRONTIERS IN NEUROSCIENCE, 2020, 14
  • [35] Network Clustering for Multi-task Learning
    Mu, Zhiying
    Gao, Dehong
    Guo, Sensen
    NEURAL PROCESSING LETTERS, 2025, 57 (01)
  • [36] A deep multimodal network for multi-task trajectory prediction
    Lei, Da
    Xu, Min
    Wang, Shuaian
    INFORMATION FUSION, 2025, 113
  • [37] Multi-task learning for abstractive text summarization with key information guide network
    Weiran Xu
    Chenliang Li
    Minghao Lee
    Chi Zhang
    EURASIP Journal on Advances in Signal Processing, 2020
  • [38] Text Emotion Distribution Learning via Multi-Task Convolutional Neural Network
    Zhang, Yuxiang
    Fu, Jiamei
    She, Dongyu
    Zhang, Ying
    Wang, Senzhang
    Yang, Jufeng
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 4595 - 4601
  • [39] A Multi-Task Learning Approach to Hate Speech Detection Leveraging Sentiment Analysis
    Plaza-Del-Arco, Flor Miriam
    Molina-Gonzalez, M. Dolores
    Urena-Lopez, L. Alfonso
    Martin-Valdivia, Maria Teresa
    IEEE ACCESS, 2021, 9 : 112478 - 112489
  • [40] Multi-task learning for abstractive text summarization with key information guide network
    Xu, Weiran
    Li, Chenliang
    Lee, Minghao
    Zhang, Chi
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2020, 2020 (01)