Analysis of Arabic Tweet Sentiment About Trending Hashtags Using Transfer Learning and Machine Learning Models

被引:0
作者
Al-Qahtani, Waad Nasier [1 ]
Muniasamy, Anandhavalli [1 ]
机构
[1] King Khalid Univ, Coll Comp Sci, Abha, Saudi Arabia
来源
2024 IEEE 7TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES, SIGNAL AND IMAGE PROCESSING, ATSIP 2024 | 2024年
关键词
Machine Learning; Opinion Mining; Sentiment Analysis; Natural Language Processing; Twitter; Lexical Semantic; Arabic Language; SELECTION;
D O I
10.1109/ATSIP62566.2024.10638917
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This study used a multi-pronged approach to examine how Gulf region Twitter users felt about COVID-19 trending topics. The data collection phase of the study started with the selection of relevant hashtags and time frames for analysis. The Twitter API was then used to compile a representative dataset of Arabic tweets pertaining to the COVID-19 trending topics. Subsequently, the research proceeded to the data annotation stage, employing a hybrid annotation technique that fused the transfer learning model and lexicon-based approach to assign a sentiment label to every tweet. Analyzing the patterns of tweet distribution over time exposed interesting patterns and possible sentiment expression influencers. The research obtained good accuracy scores by using a sentiment analysis model that combined three popular machine learning algorithms (Multinomial Naive Bayes, CountVectorizer, and TfidfVectorizer) with three feature representations (Ngram, TfidfVectorizer, and CountVectorizer). The sentiment tendencies of Arabic-speaking Twitter users toward trending topics were revealed by these scores. With the Ngram(1,2) representation, the LinearSVC algorithm achieved an impressive accuracy score of 89.1%, making it stand out as the best performer among all feature representations.
引用
收藏
页码:239 / 244
页数:6
相关论文
共 35 条
[1]  
Abdulla NA, 2013, 2013 IEEE JORDAN CONFERENCE ON APPLIED ELECTRICAL ENGINEERING AND COMPUTING TECHNOLOGIES (AEECT)
[2]  
Abu Farha I, 2019, FOURTH ARABIC NATURAL LANGUAGE PROCESSING WORKSHOP (WANLP 2019), P192
[3]   Feature extraction and selection for Arabic tweets authorship authentication [J].
Al-Ayyoub, Mahmoud ;
Jararweh, Yaser ;
Rabab'ah, Abdullateef ;
Aldwairi, Monther .
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2017, 8 (03) :383-393
[4]   Sentiment lexicon for sentiment analysis of Saudi dialect tweets [J].
Al-Thubaity, Abdulmohsen ;
Alqahtani, Qubayl ;
Aljandal, Abdulaziz .
ARABIC COMPUTATIONAL LINGUISTICS, 2018, 142 :301-307
[5]   A Sentiment Analysis Approach to Predict an Individual's Awareness of the Precautionary Procedures to Prevent COVID-19 Outbreaks in Saudi Arabia [J].
Aljameel, Sumayh S. ;
Alabbad, Dina A. ;
Alzahrani, Norah A. ;
Alqarni, Shouq M. ;
Alamoudi, Fatimah A. ;
Babili, Lana M. ;
Aljaafary, Somiah K. ;
Alshamrani, Fatima M. .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (01) :1-12
[6]  
Althagafi A, 2021, INT J ADV COMPUT SC, V12, P620
[7]   Challenges in Sentiment Analysis for Arabic Social Networks [J].
Alwakid, Ghadah ;
Osman, Taha ;
Hughes-Roberts, Thomas .
ARABIC COMPUTATIONAL LINGUISTICS (ACLING 2017), 2017, 117 :89-100
[8]  
[Anonymous], 2021, Twitter
[9]  
Awad M., 2015, Efficient Learning Machines, DOI [DOI 10.1007/978-1-4302-5990-9_3, 10.1007/978-1-4302-5990-9, DOI 10.1007/978-1-4302-5990-93, 10.1007/978-1-4302-5990-93]
[10]   Towards Arabic aspect-based sentiment analysis: a transfer learning-based approach [J].
Bensoltane, Rajae ;
Zaki, Taher .
SOCIAL NETWORK ANALYSIS AND MINING, 2022, 12 (01)