Leveraging machine learning to analyze sentiment from COVID-19 tweets: A global perspective

被引:7
|
作者
Rahman, Md Mahbubar [1 ]
Khan, Nafiz Imtiaz [1 ]
Sarker, Iqbal H. [2 ]
Ahmed, Mohiuddin [3 ]
Islam, Muhammad Nazrul [1 ]
机构
[1] Mil Inst Sci & Technol MIST, Dept Comp Sci & Engn, Dhaka 1216, Bangladesh
[2] Chittagong Univ Engn & Technol, Dept Comp Sci & Engn, Chittagong, Bangladesh
[3] Edith Cowan Univ, Sch Sci, Joondalup, WA, Australia
关键词
coronavirus; COVID-19; deep neural network; machine learning; outbreak; pandemic; prediction; sentiment analysis; social media; INTERRATER RELIABILITY;
D O I
10.1002/eng2.12572
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Since the advent of the worldwide COVID-19 pandemic, analyzing public sentiment has become one of the major concerns for policy and decision-makers. While the priority is to curb the spread of the virus, mass population (user) sentiment analysis is equally important. Though sentiment analysis using different state-of-the-art technologies has been focused on during the COVID-19 pandemic, the reasons behind the variations in public sentiment are yet to be explored. Moreover, how user sentiment varies due to the COVID-19 pandemic from a cross-country perspective has been less focused on. Therefore, the objectives of this study are: to identify the most effective machine learning (ML) technique for classifying public sentiments, to analyze the variations of public sentiment across the globe, and to find the critical contributing factors to sentiment variations. To attain the objectives, 12,000 tweets, 3000 each from the USA, UK, and Bangladesh, were rigorously annotated by three independent reviewers. Based on the labeled tweets, four different boosting ML models, namely, CatBoost, gradient boost, AdaBoost, and XGBoost, are investigated. Next, the top performed ML model predicted sentiment of 300,000 data (100,000 from each country). The public perceptions have been analyzed based on the labeled data. As an outcome, the CatBoost model showed the highest (85.8%) F1-score, followed by gradient boost (84.3%), AdaBoost (78.9%), and XGBoost (83.1%). Second, it was revealed that during the time of the COVID-19 pandemic, the sentiments of the people of the three countries mainly were negative, followed by positive and neutral. Finally, this study identified a few critical concerns that impact primarily varying public sentiment around the globe: lockdown, quarantine, hospital, mask, vaccine, and the like.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] COVID-19 Public Sentiment Insights and Machine Learning for Tweets Classification
    Samuel, Jim
    Ali, G. G. Md Nawaz
    Rahman, Md Mokhlesur
    Esawi, Ek
    Samuel, Yana
    INFORMATION, 2020, 11 (06)
  • [2] Sentiment Analysis of COVID-19 Tweets by Machine Learning and Deep Learning Classifiers
    Jain, Ritanshi
    Bawa, Seema
    Sharma, Seemu
    ADVANCES IN DATA AND INFORMATION SCIENCES, 2022, 318 : 329 - 339
  • [3] Sentiment Analysis on COVID-19 Vaccine Tweets using Machine Learning and Deep Learning Algorithms
    Jain, Tarun
    Verma, Vivek Kumar
    Sharma, Akhilesh Kumar
    Saini, Bhavna
    Purohit, Nishant
    Mahdin, Hairulnizam
    Ahmad, Masitah
    Darman, Rozanawati
    Haw, Su-Cheng
    Shaharudin, Shazlyn Milleana
    Arshad, Mohammad Syafwan
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (05) : 32 - 41
  • [4] Leveraging natural language processing and geospatial time series model to analyze COVID-19 vaccination sentiment dynamics on Tweets
    Ye, Jiancheng
    Hai, Jiarui
    Wang, Zidan
    Wei, Chumei
    Song, Jiacheng
    JAMIA OPEN, 2023, 6 (02)
  • [5] Sentiment Analysis of Bangladesh-specific COVID-19 Tweets using Deep Neural Network
    Islam, Muhammad Nazrul
    Khan, Nafiz Imtiaz
    Roy, Ayon
    Rahman, Md. Mahbubar
    Mukta, Saddam Hossain
    Islam, A. K. M. Najmul
    2021 62ND INTERNATIONAL SCIENTIFIC CONFERENCE ON INFORMATION TECHNOLOGY AND MANAGEMENT SCIENCE OF RIGA TECHNICAL UNIVERSITY (ITMS), 2021,
  • [6] TClustVID: A novel machine learning classification model to investigate topics and sentiment in COVID-19 tweets
    Satu, Md Shahriare
    Khan, Md Imran
    Mahmud, Mufti
    Uddin, Shahadat
    Summers, Matthew A.
    Quinn, Julian M. W.
    Moni, Mohammad Ali
    KNOWLEDGE-BASED SYSTEMS, 2021, 226
  • [7] A Deep Learning Approach for Sentiment Classification of COVID-19 Vaccination Tweets
    Said, Haidi
    Tawfik, BenBella S.
    Makhlouf, Mohamed A.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (04) : 530 - 538
  • [8] Leveraging Tweets for Artificial Intelligence Driven Sentiment Analysis on the COVID-19 Pandemic
    Alkhaldi, Nora A.
    Asiri, Yousef
    Mashraqi, Aisha M.
    Halawani, Hanan T.
    Abdel-Khalek, Sayed
    Mansour, Romany F.
    HEALTHCARE, 2022, 10 (05)
  • [9] NLP and Machine Learning for Sentiment Analysis in COVID-19 Tweets: A Comparative Study
    Shaik, Shahedhadeennisa
    Chaitra, S.P.
    EAI Endorsed Transactions on Pervasive Health and Technology, 2024, 10
  • [10] Sentiment Analysis of Pandemic Tweets with COVID-19 as a Prototype
    Almutiri, Mashail
    Alghamdi, Mona
    Elazhary, Hanan
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (04) : 510 - 518