A New Contrastive Learning-Based Vision Transformer for Sentiment Analysis Using Scene Text Images

被引：0

作者：

Palaiahnakote, Shivakumara ^{[1
,3
]}

Kapri, Dhruv ^{[2
]}

Saleem, Muhammad Hammad ^{[1
,3
]}

Pal, Umapada ^{[2
]}

机构：

[1] School of Science, Engineering and Environment, University of Salford, Salford

[2] Computer Vision and Pattern Recognition Unit, Indian Statistical Institute Kolkata

[3] Data Science and Artficial Intelligence (DSAI) Hub, University of Salford, Manchester

来源：

International Journal of Pattern Recognition and Artificial Intelligence | 2024年 / 38卷 / 16期

关键词：

contrastive learning; Scene text; sentiment analysis; text appearance; transformer; vision transformer;

D O I：

10.1142/S0218001424520293

中图分类号：

学科分类号：

摘要：

Sentiment analysis using scene text images is complex and challenging because it has an arbitrary background, and the method should rely on only visual features. Unlike most existing methods that use either text or images or both, this study uses only scene text images for sentiment analysis. The intuition to use only scene text images is that sometimes users express their feelings and emotions or convey their messages by writing text in different shapes with diverse background designs. It is noted that the existing methods ignore such vital cues for sentiment analysis. This work explores a vision transformer to extract visual features that represent contextual information about the appearance of the text image. Further, to strengthen the visual features, the proposed work introduces contrastive learning which maximizes the gap between inter-classes and minimizes the gap between intra-classes of positive, negative, and neutral. To demonstrate the effectiveness of the proposed method, it is tested on our own constructed dataset and benchmark dataset. A comparative study of our method with the existing method shows the proposed method is superior in the classification of positive, negative, and neutral scene text images. © 2024 World Scientific Publishing Company.

引用

共 50 条

[1] Transformer-based adaptive contrastive learning for multimodal sentiment analysis
Hu Y.
Huang X.
Wang X.
Lin H.
Zhang R.
Multimedia Tools and Applications, 2025, 84 (3) : 1385 - 1402
[2] Satellite Images Analysis and Classification using Deep Learning-based Vision Transformer Model
Adegun, Adekanmi Adeyinka
Viriri, Serestina
2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 1275 - 1279
[3] Text Sentiment Analysis Based on Transformer and Augmentation
Gong, Xiaokang
Ying, Wenhao
Zhong, Shan
Gong, Shengrong
FRONTIERS IN PSYCHOLOGY, 2022, 13
[4] Vision Transformer With Contrastive Learning for Remote Sensing Image Scene Classification
Bi, Meiqiao
Wang, Minghua
Li, Zhi
Hong, Danfeng
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 738 - 749
[5] Text-Centric Multimodal Contrastive Learning for Sentiment Analysis
Peng, Heng
Gu, Xue
Li, Jian
Wang, Zhaodan
Xu, Hao
ELECTRONICS, 2024, 13 (06)
[6] Text Sentiment Analysis Based on Binary Images
Xu, Dawei
Lv, Yue
Wang, Min
Huang, Fan
Zhang, Jiaxin
PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING, FAIML 2024, 2024, : 296 - 299
[7] Contrastive Learning-Based Cross-Domain Data Augmentation for Aspect-Based Sentiment Analysis
Xue, Xiaoling
Xu, Bin
Dong, Xiaodi
Cai, Qihang
Gao, Kening
WEB INFORMATION SYSTEMS AND APPLICATIONS, WISA 2024, 2024, 14883 : 180 - 188
[8] Contrastive Transformer Learning With Proximity Data Generation for Text-Based Person Search
Wu, Hefeng
Chen, Weifeng
Liu, Zhibin
Chen, Tianshui
Chen, Zhiguang
Lin, Liang
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7005 - 7016
[9] Enhancing Aspect-Based Sentiment Analysis with Supervised Contrastive Learning
Liang, Bin
Luo, Wangda
Li, Xiang
Gui, Lin
Yang, Min
Yu, Xiaoqi
Xu, Ruifeng
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 3242 - 3247
[10] Contrastive learning-based structure preserving projection for hyperspectral images
Zhao, Siyu
Zhang, Hongjie
Gong, Bo
Jing, Ling
Chen, Yingyi
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2022, 43 (19-24) : 7002 - 7023

← 1 2 3 4 5 →