Content-aware sentiment understanding: cross-modal analysis with encoder-decoder architectures

被引:0
作者
Pakdaman, Zahra [1 ]
Koochari, Abbas [1 ]
Sharifi, Arash [1 ]
机构
[1] Islamic Azad Univ, Dept Comp Engn, Sci & Res Branch, Tehran, Iran
来源
JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE | 2025年 / 8卷 / 02期
关键词
Sentiment analysis; Image captioning; Meme detection; Large language model; Transformer; CLASSIFICATION; NETWORK;
D O I
10.1007/s42001-025-00374-y
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
The analysis of sentiment from social media data has attracted significant attention due to the proliferation of user-generated opinions and comments on these platforms. Social media content is often multi-modal, frequently combining images and text within single posts. To effectively estimate user sentiment across multiple content types, this study proposes a multimodal content-aware approach. It distinguishes text-dominant images, memes, and regular images, extracting embedded text from memes or text-dominant images. Using the Swin Transformer-GPT-2 (encoder-decoder) architecture, captions are generated for image analysis. The user's sentiment is then estimated by analyzing embedded text, generated captions, and user-provided captions through a BiLSTM-LSTM (encoder-decoder) architecture and fully connected layers. The proposed method demonstrates superior performance, achieving 93% accuracy on the MVSA-Single dataset, 79% accuracy on the MVSA-Multiple dataset, and 90% accuracy on the TWITTER (Large) dataset surpassing current state-of-the-art methods.
引用
收藏
页数:24
相关论文
共 52 条
  • [1] Priban P., Smid J., Steinberger J., Mistera A., A comparative study of cross-lingual sentiment analysis, Expert Systems with Applications, (2024)
  • [2] Zhang H., Cheah Y., Alyasiri O., An J., Exploring aspect-based sentiment quadruple extraction with implicit aspects, opinions, and ChatGPT: A comprehensive survey, Artificial Intelligence Review, 57, (2024)
  • [3] Singh T., Rajput V., Sharma N., Kumar M., Sentiment analysis based distributed recommendation system, Multimedia Tools and Applications, (2024)
  • [4] Dey R., Das A., Neighbour adjusted dispersive flies optimization based deep hybrid sentiment analysis framework, Multimedia Tools and Applications, (2024)
  • [5] Danyal M., Khan S., Khan M., Ullah S., Mehmood F., Ali I., Proposing sentiment analysis model based on BERT and XLNet for movie reviews, Multimedia Tools and Applications, (2024)
  • [6] Strubytskyi R., Shakhovska N., Method and models for sentiment analysis and hidden propaganda finding, Computers in Human Behavior Reports, 12, (2023)
  • [7] Kumar P., Pathania K., Raman B., Zero-shot learning based cross-lingual sentiment analysis for sanskrit text with insufficient labeled data, Applied Intelligence, 53, 9, pp. 10096-10113, (2023)
  • [8] Liu Z., Liao H., Li M., Yang Q., &amp, (2023)
  • [9] Cano E., Albmore: A corpus of movie reviews for sentiment analysis in albanian. arXiv preprint arXiv, 2306, (2023)
  • [10] Saranya S., Usha G., A machine learning-based technique with IntelligentWordNet lemmatize for twitter sentiment analysis, Intelligent Automation & Soft Computing, 36, 1, (2023)