Leveraging machine learning to analyze sentiment from COVID-19 tweets: A global perspective

被引:7
|
作者
Rahman, Md Mahbubar [1 ]
Khan, Nafiz Imtiaz [1 ]
Sarker, Iqbal H. [2 ]
Ahmed, Mohiuddin [3 ]
Islam, Muhammad Nazrul [1 ]
机构
[1] Mil Inst Sci & Technol MIST, Dept Comp Sci & Engn, Dhaka 1216, Bangladesh
[2] Chittagong Univ Engn & Technol, Dept Comp Sci & Engn, Chittagong, Bangladesh
[3] Edith Cowan Univ, Sch Sci, Joondalup, WA, Australia
关键词
coronavirus; COVID-19; deep neural network; machine learning; outbreak; pandemic; prediction; sentiment analysis; social media; INTERRATER RELIABILITY;
D O I
10.1002/eng2.12572
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Since the advent of the worldwide COVID-19 pandemic, analyzing public sentiment has become one of the major concerns for policy and decision-makers. While the priority is to curb the spread of the virus, mass population (user) sentiment analysis is equally important. Though sentiment analysis using different state-of-the-art technologies has been focused on during the COVID-19 pandemic, the reasons behind the variations in public sentiment are yet to be explored. Moreover, how user sentiment varies due to the COVID-19 pandemic from a cross-country perspective has been less focused on. Therefore, the objectives of this study are: to identify the most effective machine learning (ML) technique for classifying public sentiments, to analyze the variations of public sentiment across the globe, and to find the critical contributing factors to sentiment variations. To attain the objectives, 12,000 tweets, 3000 each from the USA, UK, and Bangladesh, were rigorously annotated by three independent reviewers. Based on the labeled tweets, four different boosting ML models, namely, CatBoost, gradient boost, AdaBoost, and XGBoost, are investigated. Next, the top performed ML model predicted sentiment of 300,000 data (100,000 from each country). The public perceptions have been analyzed based on the labeled data. As an outcome, the CatBoost model showed the highest (85.8%) F1-score, followed by gradient boost (84.3%), AdaBoost (78.9%), and XGBoost (83.1%). Second, it was revealed that during the time of the COVID-19 pandemic, the sentiments of the people of the three countries mainly were negative, followed by positive and neutral. Finally, this study identified a few critical concerns that impact primarily varying public sentiment around the globe: lockdown, quarantine, hospital, mask, vaccine, and the like.
引用
收藏
页数:23
相关论文
共 50 条
  • [21] Machine Learning Approach to Analyze the Sentiment of Airline Passengers' Tweets
    Wu, Shengyang
    Gao, Yi
    TRANSPORTATION RESEARCH RECORD, 2024, 2678 (02) : 48 - 56
  • [22] A Proposed Sentiment Analysis Deep Learning Algorithm for Analyzing COVID-19 Tweets
    Harleen Kaur
    Shafqat Ul Ahsaan
    Bhavya Alankar
    Victor Chang
    Information Systems Frontiers, 2021, 23 : 1417 - 1429
  • [23] A Proposed Sentiment Analysis Deep Learning Algorithm for Analyzing COVID-19 Tweets
    Kaur, Harleen
    Ahsaan, Shafqat Ul
    Alankar, Bhavya
    Chang, Victor
    INFORMATION SYSTEMS FRONTIERS, 2021, 23 (06) : 1417 - 1429
  • [24] Modeling the Spread of COVID-19 by Leveraging Machine and Deep Learning Models
    Adnan, Muhammad
    Altalhi, Maryam
    Alarood, Ala Abdulsalam
    Uddin, M. Irfan
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 31 (03) : 1857 - 1872
  • [25] Topic based Sentiment Analysis for COVID-19 Tweets
    Abdulaziz, Manal
    Alsolamy, Mashail
    Alotaibi, Alanoud
    Alabbas, Abeer
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (01) : 626 - 636
  • [26] Analysis and Prediction of User Sentiment on COVID-19 Pandemic Using Tweets
    Yeasmin, Nilufa
    Mahbub, Nosin Ibna
    Baowaly, Mrinal Kanti
    Singh, Bikash Chandra
    Alom, Zulfikar
    Aung, Zeyar
    Azim, Mohammad Abdul
    BIG DATA AND COGNITIVE COMPUTING, 2022, 6 (02)
  • [27] Sentiment analysis of tweets about COVID-19 disease during pandemic
    Matosevic, Goran
    Bevanda, Vanja
    2020 43RD INTERNATIONAL CONVENTION ON INFORMATION, COMMUNICATION AND ELECTRONIC TECHNOLOGY (MIPRO 2020), 2020, : 1290 - 1295
  • [28] EMOCOV: Machine learning for emotion detection, analysis and visualization using COVID-19 tweets
    Kabir M.Y.
    Madria S.
    Online Social Networks and Media, 2021, 23
  • [29] ASAVACT: Arabic sentiment analysis for vaccine-related COVID-19 tweets using deep learning
    Alhumoud, Sarah
    Al Wazrah, Asma
    Alhussain, Laila
    Alrushud, Lama
    Aldosari, Atheer
    Altammami, Reema Nasser
    Almukirsh, Njood
    Alharbi, Hind
    Alshahrani, Wejdan
    PEERJ COMPUTER SCIENCE, 2023, 9 : 1 - 18
  • [30] An optimistic firefly algorithm-based deep learning approach for sentiment analysis of COVID-19 tweets
    Swapnarekha, H.
    Nayak, Janmenjoy
    Behera, H. S.
    Dash, Pandit Byomakesha
    Pelusi, Danilo
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (02) : 2382 - 2407