Analysis of Sentiments on the Onset of COVID-19 Using Machine Learning Techniques

被引:5
|
作者
Arya, Vishakha [1 ]
Mishra, Amit Kumar [1 ]
Gonzalez-Briones, Alfonso [2 ,3 ,4 ]
机构
[1] DIT Univ, Sch Comp, Dehra Dun 248, Uttarakhand, India
[2] Univ Complutense Madrid, Res Grp Agent Based Social & Interdisciplinary Ap, Madrid 28040, Spain
[3] Univ Salamanca, BISITE Res Grp, Calle Espejo S-N Edificio Multiusos I D I, Salamanca 37007, Spain
[4] Air Inst, IoT Digital Innovat Hub, Calle Segunda 4, Salamanca 37188, Spain
来源
ADCAIJ-ADVANCES IN DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE JOURNAL | 2022年 / 11卷 / 01期
关键词
Sentiment analysis; COVID-19; TF-IDF; Linear SVC; machine learning; NLTK; GBM; random forest; MENTAL-HEALTH;
D O I
10.14201/adcaij.27348
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The novel coronavirus (COVID-19) pandemic has struck the whole world and is one of the most striking topics on social media platforms. Sentiment outbreak on social media enduring various thoughts, opinions, and emotions about the COVID-19 disease, expressing views they are feeling presently. Analyzing sentiments helps to yield better results. Gathering data from different blogging sites like Facebook, Twitter, Weibo, YouTube, Instagram, etc., and Twitter is the largest repository. Videos, text, and audio were also collected from repositories. Sentiment analysis uses opinion mining to acquire the sentiments of its users and categorizes them accordingly as positive, negative, and neutral. Analytical and machine learning classification is implemented to 3586 tweets collected in different time frames. In this paper, sentiment analysis was performed on tweets accumulated during the COVID-19 pandemic, Coronavirus disease. Tweets are collected from the Twitter database using Hydrator a web-based application. Data-preprocessing removes all the noise, outliers from the raw data. With Natural Language Toolkit (NLTK), text classification for sentiment analysis and calculate the score subjective polarity, counts, and sentiment distribution. N-gram is used in textual mining -and Natural Language Processing for a continuous sequence of words in a text or document applying uni-gram, bi-gram, and tri-gram for statistical computation. Term frequency and Inverse document frequency (TF-IDF) is a feature extraction technique that converts textual data into numeric form. Vectorize data feed to our model to obtain insights from linguistic data. Linear SVC, MultinomialNB, GBM, and Random Forest classifier with Tfidf classification model applied to our proposed model. Linear Support Vector classification performs better than the other two classifiers. Results depict that RF performs better.
引用
收藏
页码:45 / 63
页数:19
相关论文
共 50 条
  • [1] Comparing tweet sentiments in megacities using machine learning techniques: In the midst of COVID-19
    Yao, Zhirui
    Yang, Junyan
    Liu, Jialin
    Keith, Michael
    Guan, Chenghe
    CITIES, 2021, 116
  • [2] Evaluating Public Sentiments of Covid-19 Vaccine Tweets Using Machine Learning Techniques
    Akpatsa, Samuel Kofi
    Lei, Hang
    Li, Xiaoyu
    Obeng, Victor-Hillary Kofi Setornyo
    INFORMATICA-AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS, 2022, 46 (01): : 69 - 75
  • [3] Sentiments Analysis of Covid-19 Vaccine Tweets Using Machine Learning and Vader Lexicon Method
    Arya, Vishakha
    Mishra, Amit Kumar
    Gonzalez-Briones, Alfonso
    ADCAIJ-ADVANCES IN DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE JOURNAL, 2022, 11 (04): : 507 - 518
  • [4] Multilayer hybrid ensemble machine learning model for analysis of Covid-19 vaccine sentiments
    Jain, Vipin
    Kashyap, Kanchan Lata
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (05) : 6307 - 6319
  • [5] Multilayer hybrid ensemble machine learning model for analysis of Covid-19 vaccine sentiments
    Jain, Vipin
    Kashyap, Kanchan Lata
    Journal of Intelligent and Fuzzy Systems, 2022, 43 (05): : 6307 - 6319
  • [6] COVID-19 Mortality Prediction Using Machine Learning Techniques
    Schirato, Lindsay
    Makina, Kennedy
    Flanders, Dwayne
    Pouriyeh, Seyedamin
    Shahriar, Hossain
    2021 IEEE INTERNATIONAL CONFERENCE ON DIGITAL HEALTH (ICDH 2021), 2021, : 197 - 202
  • [7] Covid-19 analysis by using machine and deep learning
    Yadav D.
    Maheshwari H.
    Chandra U.
    Sharma A.
    Studies in Big Data, 2020, 80 : 31 - 63
  • [8] Analysis and Prediction of COVID-19 using Machine Learning
    Parthiban, M.
    Alphy, Anna
    2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024, 2024,
  • [9] Detecting malicious COVID-19 URLs using machine learning techniques
    Ispahany, Jamil
    Islam, Rafiqul
    2021 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS AND OTHER AFFILIATED EVENTS (PERCOM WORKSHOPS), 2021, : 718 - 723
  • [10] Practical Machine Learning Techniques for COVID-19 Detection Using Chest
    Mangalmurti, Yurananatul
    Wattanapongsakorn, Naruemon
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 34 (02): : 733 - 752