A Natural Language Processing (NLP) Evaluation on COVID-19 Rumour Dataset Using Deep Learning Techniques

被引:9
作者
Fatima, Rubia [1 ]
Samad Shaikh, Naila [2 ]
Riaz, Adnan [3 ]
Ahmad, Sadique [4 ,5 ]
El-Affendi, Mohammed A. A. [4 ]
Alyamani, Khaled A. Z. [6 ]
Nabeel, Muhammad [7 ]
Ali Khan, Javed [8 ]
Yasin, Affan [1 ]
Latif, Rana M. Amir [9 ]
机构
[1] Tsinghua Univ, Sch Software, Beijing, Peoples R China
[2] Govt Degree Coll Women, Bosan Rd, Multan, Pakistan
[3] Air Univ, Fac Comp & Artificial Intelligence, Dept Creat Technol, Islamabad, Pakistan
[4] Prince Sultan Univ, Coll Comp & Informat Sci, EIAS, Data Sci & Blockchain Lab, Riyadh 11586, Saudi Arabia
[5] Bahria Univ, Dept Comp Sci, Karachi Campus, Karachi, Pakistan
[6] King Faisal Univ, Appl Coll, Abqaiq Branch, POB 4000, Al Hufuf 31982, Saudi Arabia
[7] South China Univ Technol, Sch Software Engn, Guangzhou, Peoples R China
[8] Univ Sci & Technol Bannu, Dept Software Engn, Bannu, Pakistan
[9] COMSATS Univ Islamabad, Dept Comp Sci, Sahiwal Campus, Islamabad, Pakistan
关键词
GAME;
D O I
10.1155/2022/6561622
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Context and Background: Since December 2019, the coronavirus (COVID-19) epidemic has sparked considerable alarm among the general community and significantly affected societal attitudes and perceptions. Apart from the disease itself, many people suffer from anxiety and depression due to the disease and the present threat of an outbreak. Due to the fast propagation of the virus and misleading/fake information, the issues of public discourse alter, resulting in significant confusion in certain places. Rumours are unproven facts or stories that propagate and promote sentiments of prejudice, hatred, and fear. Objective. The study's objective is to propose a novel solution to detect fake news using state-of-the-art machines and deep learning models. Furthermore, to analyse which models outperformed in detecting the fake news. Method. In the research study, we adapted a COVID-19 rumours dataset, which incorporates rumours from news websites and tweets, together with information about the rumours. It is important to analyse data utilizing Natural Language Processing (NLP) and Deep Learning (DL) approaches. Based on the accuracy, precision, recall, and the f1 score, we can assess the effectiveness of the ML and DL algorithms. Results. The data adopted from the source (mentioned in the paper) have collected 9200 comments from Google and 34,779 Twitter postings filtered for phrases connected with COVID-19-related fake news. Experiment 1. The dataset was assessed using the following three criteria: veracity, stance, and sentiment. In these terms, we have different labels, and we have applied the DL algorithms separately to each term. We have used different models in the experiment such as (i) LSTM and (ii) Temporal Convolution Networks (TCN). The TCN model has more performance on each measurement parameter in the evaluated results. So, we have used the TCN model for the practical implication for better findings. Experiment 2. In the second experiment, we have used different state-of-the-art deep learning models and algorithms such as (i) Simple RNN; (ii) LSTM + Word Embedding; (iii) Bidirectional + Word Embedding; (iv) LSTM + CNN-1D; and (v) BERT. Furthermore, we have evaluated the performance of these models on all three datasets, e.g., veracity, stance, and sentiment. Based on our second experimental evaluation, the BERT has a superior performance over the other models compared.
引用
收藏
页数:17
相关论文
共 53 条
[1]   Prediction of COVID-19 confirmed cases combining deep learning methods and Bayesian optimization [J].
Abbasimehr, Hossein ;
Paki, Reza .
CHAOS SOLITONS & FRACTALS, 2021, 142
[2]  
Agade A., 2020, Exploring the non-medical impacts of Covid-19 using natural language processing
[3]   Correlation of Chest CT and RT-PCR Testing for Coronavirus Disease 2019 (COVID-19) in China: A Report of 1014 Cases [J].
Ai, Tao ;
Yang, Zhenlu ;
Hou, Hongyan ;
Zhan, Chenao ;
Chen, Chong ;
Lv, Wenzhi ;
Tao, Qian ;
Sun, Ziyong ;
Xia, Liming .
RADIOLOGY, 2020, 296 (02) :E32-E40
[4]   Google Play Content Scraping and Knowledge Engineering using Natural Language Processing Techniques with the Analysis of User Reviews [J].
Aldabbas, Hamza ;
Bajahzar, Abdullah ;
Alruily, Meshrif ;
Qureshi, Ali Adil ;
Latif, Rana M. Amir ;
Farhan, Muhammad .
JOURNAL OF INTELLIGENT SYSTEMS, 2021, 30 (01) :192-208
[5]   Natural language processing of German clinical colorectal cancer notes for guideline-based treatment evaluation [J].
Becker, Matthias ;
Kasper, Stefan ;
Boeckmann, Britta ;
Joeckel, Karl-Heinz ;
Virchow, Isabel .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2019, 127 :141-146
[6]   Diagnosis of Diabetic Retinopathy through Retinal Fundus Images and 3D Convolutional Neural Networks with Limited Number of Samples [J].
Bin Tufail, Ahsan ;
Ullah, Inam ;
Khan, Wali Ullah ;
Asif, Muhammad ;
Ahmad, Ijaz ;
Ma, Yong-Kui ;
Khan, Rahim ;
Kalimullah ;
Ali, Md Sadek .
WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
[7]   A Novel COVID-19 Data Set and an Effective Deep Learning Approach for the De-Identification of Italian Medical Records [J].
Catelli, Rosario ;
Gargiulo, Francesco ;
Casola, Valentina ;
De Pietro, Giuseppe ;
Fujita, Hamido ;
Esposito, Massimo .
IEEE ACCESS, 2021, 9 :19097-19110
[8]  
Chapman AB, 2020, INFECT DIS INFORMATI, P279
[9]   Artificial Intelligence in Action: Addressing the COVID-19 Pandemic with Natural Language Processing [J].
Chen, Qingyu ;
Leaman, Robert ;
Allot, Alexis ;
Luo, Ling ;
Wei, Chih-Hsuan ;
Yan, Shankai ;
Lu, Zhiyong .
ANNUAL REVIEW OF BIOMEDICAL DATA SCIENCE, VOL 4, 2021, 4 :313-339
[10]   A COVID-19 Rumor Dataset [J].
Cheng, Mingxi ;
Wang, Songli ;
Yan, Xiaofeng ;
Yang, Tianqi ;
Wang, Wenshuo ;
Huang, Zehao ;
Xiao, Xiongye ;
Nazarian, Shahin ;
Bogdan, Paul .
FRONTIERS IN PSYCHOLOGY, 2021, 12