Weakly Correlated Multimodal Sentiment Analysis: New Dataset and Topic-Oriented Model

被引:0
|
作者
Liu, Wuchao [1 ]
Li, Wengen [1 ]
Ruan, Yu-Ping [2 ]
Shu, Yulou [1 ]
Chen, Juntao [1 ]
Li, Yina [1 ]
Yu, Caili [1 ]
Zhang, Yichao [1 ]
Guan, Jihong [1 ]
Zhou, Shuigeng [3 ]
机构
[1] Tongji Univ, Dept Comp Sci & Technol, Shanghai 200070, Peoples R China
[2] Zhejiang Lab, Hangzhou 311121, Peoples R China
[3] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Sentiment analysis; Social networking (online); Reviews; Analytical models; Correlation; Visualization; Blogs; Image-text alignment; multimodal sentiment analysis; topic-oriented analysis; weak correlation; LANGUAGE;
D O I
10.1109/TAFFC.2024.3396144
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing multimodal sentiment analysis models focus more on fusing highly correlated image-text pairs, and thus achieves unsatisfactory performance on multimodal social media data which usually manifests weak correlations between different modalities. To address this issue, we first build a large multimodal social media sentiment analysis dataset RU-Senti which contains more than 100,000 image-text pairs with sentiment labels. Then, we proposed a topic-oriented model (TOM) which assumes that text is usually related to a certain portion of the image contents and significant variances exist in sentiment distribution across diverse topics. TOM learns the topic information from textual content and designs a topic-oriented feature alignment module to extract textual semantics correlated information from images, thus achieving the alignment between two modalities. Then, TOM utilizes a transformer encoder initialized with the parameters from a pre-trained vision-language model to fuse the multimodal features for sentiment prediction. According to the experiments over the public MVSA-Multiple dataset and our RU-Senti dataset, RU-Senti is of high suitability for studying weakly correlated multimodal sentiment analysis, and the proposed TOM model also largely outperforms the SOTA mulitimodal sentiment analysis methods and pre-trained vision-language models.
引用
收藏
页码:2070 / 2082
页数:13
相关论文
共 50 条
  • [1] Clustering and visualizing audiovisual dataset on mobile devices in a topic-oriented manner
    Wang, Lei
    Tjondrongoro, Dian
    Liu, Yuee
    ADVANCES IN VISUAL INFORMATION SYSTEMS, 2007, 4781 : 310 - 321
  • [2] Joint Sentiment Part Topic Regression Model for Multimodal Analysis
    Li, Mengyao
    Zhu, Yonghua
    Gao, Wenjing
    Cao, Meng
    Wang, Shaoxiu
    INFORMATION, 2020, 11 (10) : 1 - 16
  • [3] Unmasking deception: a topic-oriented multimodal approach to uncover false information on social media
    Riccardo Cantini
    Cristian Cosentino
    Irene Kilanioti
    Fabrizio Marozzo
    Domenico Talia
    Machine Learning, 2025, 114 (1)
  • [4] Topic-oriented search model based on multi-agent
    Shen Jie
    Sun Rong-Shuang
    Wei Liu-Hua
    Mang Hui
    Zhu Yan
    Chen Chen
    CIS: 2007 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PROCEEDINGS, 2007, : 276 - 280
  • [5] Target-oriented multimodal sentiment classification by using topic model and gating mechanism
    Song, Zhengxin
    Xue, Yun
    Gu, Donghong
    Zhang, Haolan
    Ding, Weiping
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (07) : 2289 - 2299
  • [6] Target-oriented multimodal sentiment classification by using topic model and gating mechanism
    Zhengxin Song
    Yun Xue
    Donghong Gu
    Haolan Zhang
    Weiping Ding
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 2289 - 2299
  • [7] Detecting Topic-oriented Overlapping Community Using Hybrid a Hypergraph Model
    Shen, G. L.
    Yang, X. P.
    Sun, J.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2016, 11 (04) : 538 - 552
  • [8] A weakly-supervised graph-based joint sentiment topic model for multi-topic sentiment analysis
    Zhou, Tao
    Law, Kris
    Creighton, Douglas
    INFORMATION SCIENCES, 2022, 609 : 1030 - 1051
  • [9] Multimodal learning for topic sentiment analysis in microblogging
    Huang, Faliang
    Zhang, Shichao
    Zhang, Jilian
    Yu, Ge
    NEUROCOMPUTING, 2017, 253 : 144 - 153
  • [10] ICLB: Target-Oriented Multimodal Sentiment Classification by Using Image Caption and Topic Model
    Chen, Ziwei
    Wei, Fupeng
    Zheng, Qiusheng
    Liu, Xing
    Niu, Liyue
    COMPUTATIONAL AND EXPERIMENTAL SIMULATIONS IN ENGINEERING, ICCES 2024-VOL 2, 2025, 173 : 150 - 167