NLP and Machine Learning for Sentiment Analysis in COVID-19 Tweets: A Comparative Study

被引:0
作者
Shaik, Shahedhadeennisa [1 ]
Chaitra, S.P. [2 ]
机构
[1] Department of Computer Science Engineering, Dayananda Sagar College of Engineering
[2] Dayananda Sagar College of Engineering, Dayananda Sagar College of Engineering
关键词
Bidi-rectional Long Short-Term Memory (BLSTM); Decision Tree Classifier; K Nearest Neighbors (KNN); Logistic Regression; Machine learning Algorithms; NLP (Natural Language Processing); Performance evaluation; Sentiment analysis; Sentiment classification;
D O I
10.4108/eetpht.10.7051
中图分类号
学科分类号
摘要
In response to the COVID-19 pandemic, a novel technique is given forassessing the sentiment of individuals using Twitter data obtained from the UCI repository. Our approach involves the identification of tweets with a discernible sentiment, followed by the application of specific data preprocessing techniques to enhance data quality. We have developed a robust model capable of effectivelydiscerning the sentiments behind these tweets. To evaluate the performance of our model, we employ four distinct machine learning algorithms: logistic regression, decision tree, k-nearest neighbor and BLSTM. We classify the tweets into three categories: positive, neutral, and negative sentiments. Our performance evaluation is based on several key metrics, including accuracy, precision, recall,and F1-score. Our experimental results indicate that our proposed model excels in accurately capturing the perceptions of individuals regarding the COVID-19 pandemic. © 2024 Shahedhadeennisa Shaik et al.
引用
收藏
相关论文
共 50 条
[1]  
Ahmad N., Siddique J., Personality assessment using Twitter tweets, Procedia Com-put. Sci, 112, pp. 1964-1973, (2017)
[2]  
Ahmad T., Ramsay A., Ahmed H., Detecting emotions in English and Arabic tweets, Information, 10, 3, (2019)
[3]  
Bandi A., Fellah A., Socio-analyzer: A sentiment analysis using social media data, inProc. 28th Int. Conf. Softw. Eng. Data Eng., in EPiC Series in Computing, 64
[4]  
Dascalu S., Sharma S., Wu R., pp. 61-67, (2019)
[5]  
Barbieri F., Saggion H., Automatic detection of irony and humour in Twitter, Proc.ICCC, pp. 155-162, (2014)
[6]  
Bhat R., Singh V. K., Naik N., Kamath C. R., Mulimani P., Kulkarni N., COVID 2019 outbreak: The disappointment in Indian teachers, Asian J. Psychiatry, 50, (2020)
[7]  
Blei D. M., Ng A. Y., Jordan M. I., Latent Dirichlet allocation, J. Mach. Learn. Res, 3, pp. 993-1022, (2003)
[8]  
Boldog P., Tekeli T., Vizi Z., Denes A., Bartha F. A., Rost G., Risk assessment of novel coronavirus COVID-19 outbreaks outside China, J. Clin. Med, 9, 2, (2020)
[9]  
Carducci G., Rizzo G., Monti D., Palumbo E., Morisio M., TwitPersonality: Compu-ting personality traits from tweets using word embeddings and supervised learning, Infor-mation, 9, 5, (2018)
[10]  
Carreras X., Marquez L., Boosting trees for anti-spam email filtering, (2001)