Enhancing Emoji-Based Sentiment Classification in Urdu Tweets: Fusion Strategies With Multilingual BERT and Emoji Embeddings

被引:1
作者
Rani Narejo, Komal [1 ]
Zan, Hongying [1 ]
Oralbekova, Dina [2 ]
Parkash Dharmani, Kheem [3 ]
Mamyrbayev, Orken [2 ]
Mukhsina, Kuralai [2 ]
机构
[1] Zhengzhou Univ, Sch Comp & Artificial Intelligence, Zhengzhou 450001, Peoples R China
[2] Inst Informat & Computat Technol Almaty, Alma Ata 050060, Kazakhstan
[3] Natl Univ Comp & Emerging Sci, Sch Comp, Islamabad 04403, Pakistan
关键词
Emojis; Sentiment analysis; Social networking (online); Accuracy; Neural networks; Blogs; Analytical models; Natural language processing; Urdu tweets; sentiment analysis; fine-tuning; emojis; multilingual BERT; XLM-RoBERTa; emoji embeddings;
D O I
10.1109/ACCESS.2024.3446897
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
X (formerly known as Twitter) is a popular social network with hundreds of millions of users. We emphasize the benefits of using emojis to enhance the comprehension of user sentiment. Our objective was to analyze the sentiments expressed in Urdu language tweets, a task that can be demanding due to the language's intricate structure and diverse dialects. Our research revolves around combining emoji embeddings with the SentiUrdu-1M dataset, consisting of 1.14 million Urdu tweets and 1,194 emojis, using multilingual BERT (mBERT). The major motive of our study is twofold: 1) to evaluate the performance of pre-trained emoji2vec and our proposed method of Urdu-Specific FastText emoji embeddings in terms of their ability to distinguish emojis based on their expressions; and 2) to explore techniques for integrating Urdu tweets and emoji embeddings, including concatenation, neural network fusion, and attention mechanism fusion. Moreover, we fine-tuned the baseline models on only-text Urdu tweets using multilingual BERT and XLM-RoBERTa, achieving accuracies of 64% and 65%, respectively. Therefore, our study fills a gap in the literature by investigating the possibility of enhancing sentiment analysis in Urdu language tweets through emojis, a field that has received limited attention. The Urdu-Specific FastText emoji embeddings proposed in this paper yield better results than the pre-trained emojis from emoji2vec and improve sentiment analysis accuracy up to 95% for the neural network fusion approach.
引用
收藏
页码:126587 / 126600
页数:14
相关论文
共 54 条
[1]  
Ahmed S, 2024, Arxiv, DOI arXiv:2401.12959
[2]   Sentiment Analysis of Low-Resource Language Literature Using Data Processing and Deep Learning [J].
Ali, Aizaz ;
Khan, Maqbool ;
Khan, Khalil ;
Khan, Rehan Ullah ;
Aloraini, Abdulrahman .
CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 79 (01) :713-733
[3]   Exploiting Linguistic Features for Effective Sentence-Level Sentiment Analysis in Urdu Language [J].
Altaf, Amna ;
Anwar, Muhammad Waqas ;
Jamal, Muhammad Hasan ;
Bajwa, Usama Ijaz .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (27) :41813-41839
[4]   On the frontiers of Twitter data and sentiment analysis in election prediction: a review [J].
Alvi, Quratulain ;
Ali, Syed Farooq ;
Ahmed, Sheikh Bilal ;
Khan, Nadeem Ahmad ;
Javed, Mazhar ;
Nobanee, Haitham .
PEERJ COMPUTER SCIENCE, 2023, 9
[5]   Threatening Language Detection and Target Identification in Urdu Tweets [J].
Amjad, Maaz ;
Ashraf, Noman ;
Zhila, Alisa ;
Sidorov, Grigori ;
Zubiaga, Arkaitz ;
Gelbukh, Alexander .
IEEE ACCESS, 2021, 9 (09) :128302-128313
[6]  
Aneeza D. A., 2023, Int. J. Contemp. Issues Social Sci., V2, P837
[7]  
Arreerard R., 2023, P C REC ADV NAT LANG, P124
[8]   Multi-label emotion classification of Urdu tweets [J].
Ashraf, Noman ;
Khan, Lal ;
Butt, Sabur ;
Chang, Hsien-Tsung ;
Sidorov, Grigori ;
Gelbukh, Alexander .
PEERJ COMPUTER SCIENCE, 2022, 8
[9]   The Role of Preprocessing for Word Representation Learning in Affective Tasks [J].
Babanejad, Nastaran ;
Davoudi, Heidar ;
Agrawal, Ameeta ;
An, Aijun ;
Papagelis, Manos .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 15 (01) :254-272
[10]   A Systematic Review of Emoji: Current Research and Future Perspectives [J].
Bai, Qiyu ;
Dan, Qi ;
Mu, Zhe ;
Yang, Maokun .
FRONTIERS IN PSYCHOLOGY, 2019, 10