Sentiment Analysis on COVID-19 Vaccine Tweets using Machine Learning and Deep Learning Algorithms

被引:0
作者
Jain, Tarun [1 ]
Verma, Vivek Kumar [1 ]
Sharma, Akhilesh Kumar [2 ]
Saini, Bhavna [3 ]
Purohit, Nishant [1 ]
Mahdin, Hairulnizam [4 ]
Ahmad, Masitah [5 ]
Darman, Rozanawati [4 ]
Haw, Su-Cheng [6 ,7 ]
Shaharudin, Shazlyn Milleana [8 ]
Arshad, Mohammad Syafwan [9 ]
机构
[1] Manipal Univ Jaipur, Dehmi Kalan, Jaipur Ajmer Expressway, Jaipur 303007, Rajasthan, India
[2] Manipal Univ Jaipur, Sch Informat Technol, Jaipur, Rajasthan, India
[3] Cent Univ Rajasthan, Rajasthan, India
[4] Univ Tun Hussein Onn Malaysia, Fac Comp Sci & Informat Technol, Parit Raja, Malaysia
[5] Multimedia Univ, Fac Comp & Informat, Jalan Multimedia, Cyberjaya 63100, Malaysia
[6] Univ Pendidikan Sultan Idris, Fac Sci & Math, Dept Math, Perak, Malaysia
[7] Columbia Univ, Dept Stat, New York, NY USA
[8] Univ Teknol MARA Shah Alam, Fac Comp & Math Sci, Selangor, Malaysia
[9] MZR Global Sdn Bhd, Jalan Kristal K7-K, Seksyen 7,Malaysia 12, Shah Alam 40000, Selangor, Malaysia
关键词
Covid-19; vaccine; sentiment analysis; machine learning; deep learning; natural language processing;
D O I
10.14569/IJACSA.2023.0140504
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
One of the main functions of NLP (Natural Language Processing) is to analyze a sentiment or opinion of the text considered. In this research the objective is to analyze the sentiment in the form of tweets towards the Covid-19 vaccination. In this study, the collected tweets are in the form of a dataset from Kaggle that have been categorized into positive and negative depending on the polarity of the sentiment in that tweet, to visualize the overall situation. The reviews are translated into vector representations using various techniques, including Bag-Of-Words and TF-IDF to ensure the best result. Machine learning algorithms like Logistic Regression, Naive Bayes, Support Vector Machine (SVM) and others, and Deep Learning algorithms like LSTM and Bert were used to train the predictive models. The performance metrics used to test the performance of the models show that Support Vector Machine (SVM) achieved the highest accuracy of 88.7989% among the machine learning models. Compared to the related research papers the highest accuracy obtained using LSTM is 90.59 % and our model has predicted with the highest accuracy of 90.42% using BERT techniques.
引用
收藏
页码:32 / 41
页数:10
相关论文
共 29 条
[1]   RETRACTED: Deep Learning-Based Sentiment Analysis of COVID-19 Vaccination Responses from Twitter Data (Retracted Article) [J].
Alam, Kazi Nabiul ;
Khan, Md Shakib ;
Dhruba, Abdur Rab ;
Khan, Mohammad Monirujjaman ;
Al-Amri, Jehad F. ;
Masud, Mehedi ;
Rawashdeh, Majdi .
COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2021, 2021
[2]   Comparing two sentiment analysis approaches by understand the hesitancy to COVID-19 vaccine based on Twitter data in two cultures [J].
Alsabban, Malak A. .
PROCEEDINGS OF THE 13TH ACM WEB SCIENCE CONFERENCE, COMPANION VOLUME, WEBSCI 2021, 2021, :143-144
[3]   Worldwide COVID-19 Vaccines Sentiment Analysis Through Twitter Content [J].
Ansari, Md Tarique Jamal ;
Khan, Naseem Ahmad .
ELECTRONIC JOURNAL OF GENERAL MEDICINE, 2021, 18 (06)
[4]   Aspect Based Twitter Sentiment Analysis on Vaccination and Vaccine Types in COVID-19 Pandemic With Deep Learning [J].
Aygun, Irfan ;
Kaya, Buket ;
Kaya, Mehmet .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (05) :2360-2369
[5]   Public Opinions about Online Learning during COVID-19: A Sentiment Analysis Approach [J].
Bhagat, Kaushal Kumar ;
Mishra, Sanjaya ;
Dixit, Alakh ;
Chang, Chun-Yen .
SUSTAINABILITY, 2021, 13 (06)
[6]   COVID-19 Outbreak Forecasting Based on Vaccine Rates and Tweets Classification [J].
Didi, Y. ;
Walha, A. ;
Ben Halima, M. ;
Wali, A. .
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
[7]  
Dua S., MEDIUM
[8]  
Dubey AD, 2021, PUBLIC SENTIMENT ANA
[9]  
Ikonomakis M., 2005, WSEAS Transactions on Computers, V4, P966
[10]  
Joshi Bhavya, 2022, International Journal of Computers and Applications, P907, DOI 10.1080/1206212X.2021.1983289