Deep Sentiment Classification and Topic Discovery on Novel Coronavirus or COVID-19 Online Discussions: NLP Using LSTM Recurrent Neural Network Approach

被引:251
作者
Jelodar, Hamed [1 ]
Wang, Yongli [1 ]
Orji, Rita [2 ]
Huang, Shucheng [3 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Technol, Nanjing 210094, Peoples R China
[2] Dalhousie Univ, Fac Comp Sci, Halifax, NS, Canada
[3] Jiangsu Univ Sci & Technol, Sch Comp, Zhenjiang 212003, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Natural language processing; Social networking (online); Viruses (medical); Computational modeling; COVID-19; Coronavirus; Natural Language Processing; Topic modeling; Deep Learning; HEALTH; FUZZY; MODEL;
D O I
10.1109/JBHI.2020.3001216
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Internet forums and public social media, such as online healthcare forums, provide a convenient channel for users (people/patients) concerned about health issues to discuss and share information with each other. In late December 2019, an outbreak of a novel coronavirus (infection from which results in the disease named COVID-19) was reported, and, due to the rapid spread of the virus in other parts of the world, the World Health Organization declared a state of emergency. In this paper, we used automated extraction of COVID-19-related discussions from social media and a natural language process (NLP) method based on topic modeling to uncover various issues related to COVID-19 from public opinions. Moreover, we also investigate how to use LSTM recurrent neural network for sentiment classification of COVID-19 comments. Our findings shed light on the importance of using public opinions and suitable computational techniques to understand issues surrounding COVID-19 and to guide related decision-making. In addition, experiments demonstrated that the research model achieved an accuracy of 81.15% - a higher accuracy than that of several other well-known machine-learning algorithms for COVID-19-Sentiment Classification.
引用
收藏
页码:2733 / 2742
页数:10
相关论文
共 47 条
[1]  
Abualigah L., 2020, Recent Adv NLP: Case Arabic Language, P129
[2]   Improving the Reliability of Deep Neural Networks in NLP: A Review [J].
Alshemali, Basemah ;
Kalita, Jugal .
KNOWLEDGE-BASED SYSTEMS, 2020, 191
[3]  
Balamurali A., 2020, NEURAL NETWORKS NATU, P122, DOI [DOI 10.4018/978-1-7998-1159, 10.4018/978-1-7998-1159]
[4]   Unsupervised Classification of Health Content on Reddit [J].
Barros, Joana M. ;
Buitelaar, Paul ;
Duggan, Jim ;
Rebholz-Schuhmann, Dietrich .
PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON DIGITAL PUBLIC HEALTH (DPH '19), 2019, :85-89
[6]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[7]  
Callison-Burch C., 2019, POSTGRAD MED
[8]   Detecting and Characterizing Trends in Online Mental Health Discussions [J].
Chakravorti, Dante ;
Law, Kathleen ;
Gemmell, Jonathan ;
Raicu, Daniela .
2018 18TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2018, :697-706
[9]  
Dhama K, 2020, CLIN MICROBIOL REV, V33, DOI [10.1128/CMR.00028-20, 10.1038/s41432-020-0088-4]
[10]  
Doumit S, 2011, 2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), P3068, DOI 10.1109/IJCNN.2011.6033626