Crossmodal Translation Based Meta Weight Adaption for Robust Image-Text Sentiment Analysis

被引:0
作者
Zhang, Baozheng [1 ,2 ,3 ]
Yuan, Ziqi [2 ]
Xu, Hua [2 ,4 ]
Gao, Kai [3 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, State Key Lab Intelligent Technol & Syst, Beijing 100084, Peoples R China
[2] Samton Jiangxi Technol Dev Co Ltd, Nanchang 330036, Peoples R China
[3] Hebei Univ Sci & Technol, Sch Informat Sci & Engn, Shijiazhuang 050018, Peoples R China
[4] Tsinghua Univ, Dept Comp Sci & Technol, State Key Lab Intelligent Technol & Syst, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Robustness; Task analysis; Sentiment analysis; Semantics; Metalearning; Representation learning; Social networking (online); Crossmodal translation; image-text sentiment analysis; meta learning; robustness and reliability; CLASSIFICATION; NETWORK;
D O I
10.1109/TMM.2024.3405662
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image-Text Sentiment Analysis task has garnered increased attention in recent years due to the surge in user-generated content on social media platforms. Previous research efforts have made noteworthy progress by leveraging the affective concepts shared between vision and text modalities. However, emotional cues may reside exclusively within one of the prevailing modalities, owing to modality independent nature and the potential absence of certain modalities. In this study, we aim to emphasize the significance of modality-independent emotional behaviors, in addition to the modality-invariant behaviors. To achieve this, we propose a novel approach called Crossmodal Translation-Based Meta Weight Adaption (CTMWA). Specifically, our approach involves the construction of the crossmodal translation network, which serves as the encoder. This architecture captures the shared concepts between vision content and text, empowering the model to effectively handle scenarios where either the vision or textual modality is missing. Building upon the translation-based framework, we introduce the strategy of unimodal weight adaption. Leveraging the meta-learning paradigm, our proposed strategy gradually learns to acquire unimodal weights for individual instances from a few hand-crafted meta instances with unimodal annotations. This enables us to modulate the gradients of each modality encoder based on the discrepancy between modalities during model training. Extensive experiments are conducted on three benchmark image-text sentiment analysis datasets, namely MVSA-Single, MVSA-Multiple, and TumEmo. The empirical results demonstrate that our proposed approach achieves the highest performance across all conventional image-text databases. Furthermore, experiments under modality missing settings and case study for reliable sentiment prediction are also conducted further exhibiting superior robustness as well as reliability of the propose approach.
引用
收藏
页码:9949 / 9961
页数:13
相关论文
共 50 条
  • [21] Beyond Sentiment Analysis: A Review of Recent Trends in Text Based Sentiment Analysis and Emotion Detection
    Hung, Lai Po
    Alias, Suraya
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2023, 27 (01) : 84 - 95
  • [22] Research on image of enterprise after-sales service based on text sentiment analysis
    Dai, Yonghui
    Wang, Ying
    Xu, Bo
    Wu, Yingyi
    Xian, Jin
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2020, 22 (2-3) : 346 - 354
  • [23] SentiReview: Sentiment Analysis based on Text and Emoticons
    Yadav, Payal
    Pandya, Dhatri
    2017 INTERNATIONAL CONFERENCE ON INNOVATIVE MECHANISMS FOR INDUSTRY APPLICATIONS (ICIMIA), 2017, : 467 - 472
  • [24] Text Sentiment Analysis Based on Transformer and Augmentation
    Gong, Xiaokang
    Ying, Wenhao
    Zhong, Shan
    Gong, Shengrong
    FRONTIERS IN PSYCHOLOGY, 2022, 13
  • [25] Text Sentiment Analysis Based on Binary Images
    Xu, Dawei
    Lv, Yue
    Wang, Min
    Huang, Fan
    Zhang, Jiaxin
    PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING, FAIML 2024, 2024, : 296 - 299
  • [26] Improving Description-Based Person Re-Identification by Multi-Granularity Image-Text Alignments
    Niu, Kai
    Huang, Yan
    Ouyang, Wanli
    Wang, Liang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 5542 - 5556
  • [27] Multi-Modal Sentiment Analysis Based on Image and Text Fusion Based on Cross-Attention Mechanism
    Li, Hongchan
    Lu, Yantong
    Zhu, Haodong
    ELECTRONICS, 2024, 13 (11)
  • [28] Meta-Learning Based Domain Prior With Application to Optical-ISAR Image Translation
    Liao, Huaizhang
    Xia, Jingyuan
    Yang, Zhixiong
    Pan, Fulin
    Liu, Zhen
    Liu, Yongxiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7041 - 7056
  • [29] Text Sentiment Analysis Based on Similarity and Cloud Model
    Ma, Changlin
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 4350 - 4354
  • [30] Evaluation of Product Reviews Based on Text Sentiment Analysis
    Jiang, Yuhao
    Wang, Haiguang
    Yi, Tianlun
    PROCEEDINGS OF 2021 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS '21), 2021,