NLP and Machine Learning for Sentiment Analysis in COVID-19 Tweets: A Comparative Study

被引：0

作者：

Shaik, Shahedhadeennisa ^{[1
]}

Chaitra, S.P. ^{[2
]}

机构：

[1] Department of Computer Science Engineering, Dayananda Sagar College of Engineering

[2] Dayananda Sagar College of Engineering, Dayananda Sagar College of Engineering

来源：

EAI Endorsed Transactions on Pervasive Health and Technology | 2024年 / 10卷

关键词：

Bidi-rectional Long Short-Term Memory (BLSTM); Decision Tree Classifier; K Nearest Neighbors (KNN); Logistic Regression; Machine learning Algorithms; NLP (Natural Language Processing); Performance evaluation; Sentiment analysis; Sentiment classification;

D O I：

10.4108/eetpht.10.7051

中图分类号：

学科分类号：

摘要：

In response to the COVID-19 pandemic, a novel technique is given forassessing the sentiment of individuals using Twitter data obtained from the UCI repository. Our approach involves the identification of tweets with a discernible sentiment, followed by the application of specific data preprocessing techniques to enhance data quality. We have developed a robust model capable of effectivelydiscerning the sentiments behind these tweets. To evaluate the performance of our model, we employ four distinct machine learning algorithms: logistic regression, decision tree, k-nearest neighbor and BLSTM. We classify the tweets into three categories: positive, neutral, and negative sentiments. Our performance evaluation is based on several key metrics, including accuracy, precision, recall,and F1-score. Our experimental results indicate that our proposed model excels in accurately capturing the perceptions of individuals regarding the COVID-19 pandemic. © 2024 Shahedhadeennisa Shaik et al.

引用

共 50 条

[1]

Ahmad N., Siddique J., Personality assessment using Twitter tweets, Procedia Com-put. Sci, 112, pp. 1964-1973, (2017)

[2]

Ahmad T., Ramsay A., Ahmed H., Detecting emotions in English and Arabic tweets, Information, 10, 3, (2019)

[3]

Bandi A., Fellah A., Socio-analyzer: A sentiment analysis using social media data, inProc. 28th Int. Conf. Softw. Eng. Data Eng., in EPiC Series in Computing, 64

[4]

Dascalu S., Sharma S., Wu R., pp. 61-67, (2019)

[5]

Barbieri F., Saggion H., Automatic detection of irony and humour in Twitter, Proc.ICCC, pp. 155-162, (2014)

[6]

Bhat R., Singh V. K., Naik N., Kamath C. R., Mulimani P., Kulkarni N., COVID 2019 outbreak: The disappointment in Indian teachers, Asian J. Psychiatry, 50, (2020)

[7]

Blei D. M., Ng A. Y., Jordan M. I., Latent Dirichlet allocation, J. Mach. Learn. Res, 3, pp. 993-1022, (2003)

[8]

Boldog P., Tekeli T., Vizi Z., Denes A., Bartha F. A., Rost G., Risk assessment of novel coronavirus COVID-19 outbreaks outside China, J. Clin. Med, 9, 2, (2020)

[9]

Carducci G., Rizzo G., Monti D., Palumbo E., Morisio M., TwitPersonality: Compu-ting personality traits from tweets using word embeddings and supervised learning, Infor-mation, 9, 5, (2018)

[10]

Carreras X., Marquez L., Boosting trees for anti-spam email filtering, (2001)

← 1 2 3 4 5 →