Multi-Class Sentiment Analysis of COVID-19 Tweets by Machine Learning and Deep Learning Approaches

被引:0
作者
Moustafa, Maaskri [1 ]
Mokhtar-Mostefaoui, Sid Ahmed [1 ]
Hadj-Meghazi, Madani [1 ]
Goismi, Mohamed [2 ]
机构
[1] Univ Tiaret, Comp Sci Dept, LRIAS Lab, Tiaret, Algeria
[2] Dr Tahar Moulay Univ, Comp Sci Dept, GeCoDe Lab, Saida, Algeria
来源
COMPUTACION Y SISTEMAS | 2024年 / 28卷 / 02期
关键词
Ensemble machine learning; deep learning; voting; bagging; stacking; BERT;
D O I
10.13053/CyS-28-2-4568
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
COVID-19 is a virus that has spread rapidly over the globe. The condition has repercussions beyond the realm of public health. Twitter is one platform where people post reactions to events during the outbreak. User -generated information, like tweets, presents unique challenges for sentiment analysis on Twitter data. With that in mind, this work employs four methods for analyzing Twitter data in terms of sentiment: the vector space model (TF-IDF) with three different ensemble machine learning models (voting, bagging, and stacking) and BERT (Bidirectional Encoder Representations from Transformers). Experiments showed that BERT outperformed the other three techniques, with an F1 -score of 74%, a precision of 74%, and a recall of 74% for categorizing five sentiment classes on data from a Kaggle competition (Coronavirus tweets NLP-Text Classification).
引用
收藏
页码:507 / 516
页数:10
相关论文
共 31 条
[1]   Top Concerns of Tweeters During the COVID-19 Pandemic: Infoveillance Study [J].
Abd-Alrazaq, Alaa ;
Alhuwail, Dari ;
Househ, Mowafa ;
Hamdi, Mounir ;
Shah, Zubair .
JOURNAL OF MEDICAL INTERNET RESEARCH, 2020, 22 (04)
[2]   Developing a fake news identification model with advanced deep language transformers for Turkish COVID-19 misinformation data [J].
Bozuyla, Mehmet ;
Ozcift, Akin .
TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2022, 30 (03) :908-926
[3]   Sentiment Analysis of COVID-19 tweets by Deep Learning Classifiers-A study to show how popularity is affecting accuracy in social media [J].
Chakraborty, Koyel ;
Bhatia, Surbhi ;
Bhattacharyya, Siddhartha ;
Platos, Jan ;
Bag, Rajib ;
Hassanien, Aboul Ella .
APPLIED SOFT COMPUTING, 2020, 97
[4]   Natural Language Processing for Sentiment Analysis [J].
Chong, Wei Yen ;
Selvaretnam, Bhawani ;
Soon, Lay-Ki .
PROCEEDINGS 2014 4TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE WITH APPLICATIONS IN ENGINEERING AND TECHNOLOGY ICAIET 2014, 2014, :212-217
[5]   Topic detection and sentiment analysis in Twitter content related to COVID-19 from Brazil and the USA [J].
Garcia, Klaifer ;
Berton, Lilian .
APPLIED SOFT COMPUTING, 2021, 101
[6]   Efficient Hyperparameter Tuning with Grid Search for Text Categorization using kNN Approach with BM25 Similarity [J].
Ghawi, Raji ;
Pfeffer, Juergen .
OPEN COMPUTER SCIENCE, 2019, 9 (01) :160-180
[7]   Deep Learning-Based Sentiment Analysis for Roman Urdu Text [J].
Ghulam, Hussain ;
Zeng, Feng ;
Li, Wenjia ;
Xiao, Yutong .
2018 INTERNATIONAL CONFERENCE ON IDENTIFICATION, INFORMATION AND KNOWLEDGE IN THE INTERNET OF THINGS, 2019, 147 :131-135
[8]   Whether the weather will help us weather the COVID-19 pandemic: Using machine learning to measure twitter users' perceptions [J].
Gupta, Marichi ;
Bansal, Aditya ;
Jain, Bhav ;
Rochelle, Jillian ;
Oak, Atharv ;
Jalali, Mohammad S. .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2021, 145
[9]   A comparison of feature extraction techniques for malware analysis [J].
Imran, Mohammad ;
Afzal, Muhammad Tanvir ;
Qadir, Muhammad Abdul .
TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2017, 25 (02) :1173-1183
[10]  
Islam Muhammad Nazrul, 2020, IEEE Trans Artif Intell, V1, P258, DOI [10.1109/tai.2021.3062771, 10.1109/TAI.2021.3062771]