Crossmodal Translation Based Meta Weight Adaption for Robust Image-Text Sentiment Analysis

被引:0
作者
Zhang, Baozheng [1 ,2 ,3 ]
Yuan, Ziqi [2 ]
Xu, Hua [2 ,4 ]
Gao, Kai [3 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, State Key Lab Intelligent Technol & Syst, Beijing 100084, Peoples R China
[2] Samton Jiangxi Technol Dev Co Ltd, Nanchang 330036, Peoples R China
[3] Hebei Univ Sci & Technol, Sch Informat Sci & Engn, Shijiazhuang 050018, Peoples R China
[4] Tsinghua Univ, Dept Comp Sci & Technol, State Key Lab Intelligent Technol & Syst, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Robustness; Task analysis; Sentiment analysis; Semantics; Metalearning; Representation learning; Social networking (online); Crossmodal translation; image-text sentiment analysis; meta learning; robustness and reliability; CLASSIFICATION; NETWORK;
D O I
10.1109/TMM.2024.3405662
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image-Text Sentiment Analysis task has garnered increased attention in recent years due to the surge in user-generated content on social media platforms. Previous research efforts have made noteworthy progress by leveraging the affective concepts shared between vision and text modalities. However, emotional cues may reside exclusively within one of the prevailing modalities, owing to modality independent nature and the potential absence of certain modalities. In this study, we aim to emphasize the significance of modality-independent emotional behaviors, in addition to the modality-invariant behaviors. To achieve this, we propose a novel approach called Crossmodal Translation-Based Meta Weight Adaption (CTMWA). Specifically, our approach involves the construction of the crossmodal translation network, which serves as the encoder. This architecture captures the shared concepts between vision content and text, empowering the model to effectively handle scenarios where either the vision or textual modality is missing. Building upon the translation-based framework, we introduce the strategy of unimodal weight adaption. Leveraging the meta-learning paradigm, our proposed strategy gradually learns to acquire unimodal weights for individual instances from a few hand-crafted meta instances with unimodal annotations. This enables us to modulate the gradients of each modality encoder based on the discrepancy between modalities during model training. Extensive experiments are conducted on three benchmark image-text sentiment analysis datasets, namely MVSA-Single, MVSA-Multiple, and TumEmo. The empirical results demonstrate that our proposed approach achieves the highest performance across all conventional image-text databases. Furthermore, experiments under modality missing settings and case study for reliable sentiment prediction are also conducted further exhibiting superior robustness as well as reliability of the propose approach.
引用
收藏
页码:9949 / 9961
页数:13
相关论文
共 50 条
  • [41] A Study of the Application of Weight Distributing Method Combining Sentiment Dictionary and TF-IDF for Text Sentiment Analysis
    Liu, Hao
    Chen, Xi
    Liu, Xiaoxiao
    IEEE ACCESS, 2022, 10 : 32280 - 32289
  • [42] Detection of citrus diseases in complex backgrounds based on image-text multimodal fusion and knowledge assistance
    Qiu, Xia
    Chen, Hongwen
    Huang, Ping
    Zhong, Dan
    Guo, Tao
    Pu, Changbin
    Li, Zongnan
    Liu, Yongling
    Chen, Jin
    Wang, Si
    FRONTIERS IN PLANT SCIENCE, 2023, 14
  • [43] An Adaptive Masked Attention Mechanism to Act on the Local Text in a Global Context for Aspect-Based Sentiment Analysis
    Lin, Te
    Joe, Inwhee
    IEEE ACCESS, 2023, 11 : 43055 - 43066
  • [44] Modal Contrastive Learning Based End-to-End Text Image Machine Translation
    Ma, Cong
    Han, Xu
    Wu, Linghui
    Zhang, Yaping
    Zhao, Yang
    Zhou, Yu
    Zong, Chengqing
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 2153 - 2165
  • [45] Visual sentiment topic model based microblog image sentiment analysis
    Cao, Donglin
    Ji, Rongrong
    Lin, Dazhen
    Li, Shaozi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (15) : 8955 - 8968
  • [46] Sentiment analysis of Chinese micro-blog text based on extended sentiment dictionary
    Zhang, Shunxiang
    Wei, Zhongliang
    Wang, Yin
    Liao, Tao
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 81 : 395 - 403
  • [47] Anomaly analysis based on meta-subspace approach for sentiment classification
    Sudha, K.
    Suguna, N.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (04) : 3403 - 3412
  • [48] Weibo Text Sentiment Analysis Based on BERT and Deep Learning
    Li, Hongchan
    Ma, Yu
    Ma, Zishuai
    Zhu, Haodong
    APPLIED SCIENCES-BASEL, 2021, 11 (22):
  • [49] Research on text sentiment analysis method based on knowledge enhancement
    Ren, Yiping
    Zhao, Yahui
    Jin, Guozhe
    Jiang, Kexin
    Li, De
    Cui, Rongyi
    2023 3RD ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS TECHNOLOGY AND COMPUTER SCIENCE, ACCTCS, 2023, : 276 - 280
  • [50] English text sentiment analysis based on generative adversarial network
    Xuanyan Gong
    Evolutionary Intelligence, 2023, 16 : 1599 - 1607