Deep Word Embedding and Language Model based Sentimental Analysis of COVID-19 Tweets

被引:0
作者
Qaiser, Sobia [1 ]
Akram, M. Usman [1 ]
机构
[1] Natl Univ Sci & Technol, Dept Comp & Software Engn, Islamabad, Pakistan
来源
2023 20TH INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGY, IBCAST 2023 | 2023年
关键词
sentiment analysis; Covid-19; tweets; coronavirus; Twitter; deep learning; deep word embedding; BERT;
D O I
10.1109/IBCAST59916.2023.10712912
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The great Covid-19 pandemic affected billions of people's lives personally and socially. The research involves the analysis of public's views and opinions shared on Twitter social media platform related to Covid-19 pandemic and its detrimental or non-detrimental effects on public's mental health by using machine learning algorithm. The main purpose of this research includes analyzing public views related to Covid-19 pandemic by classifying the Tweets collected from the Twitter social platform. The proposed approach combines deep word embedding with MiniLM as an encoder to produce word vectors of high-dimensionality to preserve the words' semantic information. The resultant word vectors were used to train the model for the classification of the tweet in five sentiments i.e. Positive, Extremely Positive, Negative, Extremely Negative, and Neutral. The methodology is tested using publicly available Kaggle dataset as well as privately collected tweets. The comparative evaluation of the models revealed that MiniLM outperformed existing BERT based counterparts and attained highest accuracy of 93% with the Kaggle dataset. This analysis can assist the medical health authorities to monitor health information, conduct, and plan interventions to lower the pandemic effect and can help government to take precautionary measures.
引用
收藏
页码:202 / 207
页数:6
相关论文
共 22 条
[1]  
Abdulaziz M, 2021, INT J ADV COMPUT SC, V12, P626
[2]  
[Anonymous], 2017, Improving the Accuracy of Pre-trained Word Embeddings for Sentiment Analysis
[3]  
Barbieri F., 2020, ACL Anthology
[4]   Sentimental Analysis of COVID-19 Tweets Using Deep Learning Models [J].
Chintalapudi, Nalini ;
Battineni, Gopi ;
Amenta, Francesco .
INFECTIOUS DISEASE REPORTS, 2021, 13 (02) :329-339
[5]  
dataverse.harvard.edu, Harvard Dataverse Dataset Online, DOI [10.7910/DVN/LW0BTB&version=9.1, DOI 10.7910/DVN/LW0BTB&VERSION=9.1]
[6]  
Deho OB, 2018, IEEE INT CONF ADAPT
[7]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[8]  
Farhan AN, 2017, 2017 4TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATICS, CONCEPTS, THEORY, AND APPLICATIONS (ICAICTA) PROCEEDINGS
[9]   COVID-19's Impact on Stock Prices Across Different Sectors-An Event Study Based on the Chinese Stock Market [J].
He, Pinglin ;
Sun, Yulong ;
Zhang, Ying ;
Li, Tao .
EMERGING MARKETS FINANCE AND TRADE, 2020, 56 (10) :2198-2212
[10]  
kaggle, Kaggle Dataset Online