A New Contrastive Learning-Based Vision Transformer for Sentiment Analysis Using Scene Text Images

被引:0
作者
Palaiahnakote, Shivakumara [1 ,3 ]
Kapri, Dhruv [2 ]
Saleem, Muhammad Hammad [1 ,3 ]
Pal, Umapada [2 ]
机构
[1] School of Science, Engineering and Environment, University of Salford, Salford
[2] Computer Vision and Pattern Recognition Unit, Indian Statistical Institute Kolkata
[3] Data Science and Artficial Intelligence (DSAI) Hub, University of Salford, Manchester
关键词
contrastive learning; Scene text; sentiment analysis; text appearance; transformer; vision transformer;
D O I
10.1142/S0218001424520293
中图分类号
学科分类号
摘要
Sentiment analysis using scene text images is complex and challenging because it has an arbitrary background, and the method should rely on only visual features. Unlike most existing methods that use either text or images or both, this study uses only scene text images for sentiment analysis. The intuition to use only scene text images is that sometimes users express their feelings and emotions or convey their messages by writing text in different shapes with diverse background designs. It is noted that the existing methods ignore such vital cues for sentiment analysis. This work explores a vision transformer to extract visual features that represent contextual information about the appearance of the text image. Further, to strengthen the visual features, the proposed work introduces contrastive learning which maximizes the gap between inter-classes and minimizes the gap between intra-classes of positive, negative, and neutral. To demonstrate the effectiveness of the proposed method, it is tested on our own constructed dataset and benchmark dataset. A comparative study of our method with the existing method shows the proposed method is superior in the classification of positive, negative, and neutral scene text images. © 2024 World Scientific Publishing Company.
引用
收藏
相关论文
共 50 条
  • [1] Transformer-based adaptive contrastive learning for multimodal sentiment analysis
    Hu Y.
    Huang X.
    Wang X.
    Lin H.
    Zhang R.
    Multimedia Tools and Applications, 2025, 84 (3) : 1385 - 1402
  • [2] Satellite Images Analysis and Classification using Deep Learning-based Vision Transformer Model
    Adegun, Adekanmi Adeyinka
    Viriri, Serestina
    2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 1275 - 1279
  • [3] Text Sentiment Analysis Based on Transformer and Augmentation
    Gong, Xiaokang
    Ying, Wenhao
    Zhong, Shan
    Gong, Shengrong
    FRONTIERS IN PSYCHOLOGY, 2022, 13
  • [4] Vision Transformer With Contrastive Learning for Remote Sensing Image Scene Classification
    Bi, Meiqiao
    Wang, Minghua
    Li, Zhi
    Hong, Danfeng
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 738 - 749
  • [5] Text-Centric Multimodal Contrastive Learning for Sentiment Analysis
    Peng, Heng
    Gu, Xue
    Li, Jian
    Wang, Zhaodan
    Xu, Hao
    ELECTRONICS, 2024, 13 (06)
  • [6] Text Sentiment Analysis Based on Binary Images
    Xu, Dawei
    Lv, Yue
    Wang, Min
    Huang, Fan
    Zhang, Jiaxin
    PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING, FAIML 2024, 2024, : 296 - 299
  • [7] Contrastive Learning-Based Cross-Domain Data Augmentation for Aspect-Based Sentiment Analysis
    Xue, Xiaoling
    Xu, Bin
    Dong, Xiaodi
    Cai, Qihang
    Gao, Kening
    WEB INFORMATION SYSTEMS AND APPLICATIONS, WISA 2024, 2024, 14883 : 180 - 188
  • [8] Contrastive Transformer Learning With Proximity Data Generation for Text-Based Person Search
    Wu, Hefeng
    Chen, Weifeng
    Liu, Zhibin
    Chen, Tianshui
    Chen, Zhiguang
    Lin, Liang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7005 - 7016
  • [9] Enhancing Aspect-Based Sentiment Analysis with Supervised Contrastive Learning
    Liang, Bin
    Luo, Wangda
    Li, Xiang
    Gui, Lin
    Yang, Min
    Yu, Xiaoqi
    Xu, Ruifeng
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 3242 - 3247
  • [10] Contrastive learning-based structure preserving projection for hyperspectral images
    Zhao, Siyu
    Zhang, Hongjie
    Gong, Bo
    Jing, Ling
    Chen, Yingyi
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2022, 43 (19-24) : 7002 - 7023