Deep Ensemble Fake News Detection Model Using Sequential Deep Learning Technique

被引:17
作者
Ali, Abdullah Marish [1 ]
Ghaleb, Fuad A. [2 ,3 ]
Al-Rimy, Bander Ali Saleh [2 ]
Alsolami, Fawaz Jaber [1 ]
Khan, Asif Irshad [1 ]
机构
[1] King Abdulaziz Univ, Fac Comp & Informat Technol, Dept Comp Sci, Jeddah 21589, Saudi Arabia
[2] Univ Teknol Malaysia, Fac Engn, Sch Comp, Johor Baharu 81310, Malaysia
[3] Sanaa Community Coll, Dept Comp Engn & Elect, Sanaa 5695, Yemen
关键词
fake news detection; misinformation; two-stage classification; deep learning; ensemble model;
D O I
10.3390/s22186970
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Recently, fake news has been widely spread through the Internet due to the increased use of social media for communication. Fake news has become a significant concern due to its harmful impact on individual attitudes and the community's behavior. Researchers and social media service providers have commonly utilized artificial intelligence techniques in the recent few years to rein in fake news propagation. However, fake news detection is challenging due to the use of political language and the high linguistic similarities between real and fake news. In addition, most news sentences are short, therefore finding valuable representative features that machine learning classifiers can use to distinguish between fake and authentic news is difficult because both false and legitimate news have comparable language traits. Existing fake news solutions suffer from low detection performance due to improper representation and model design. This study aims at improving the detection accuracy by proposing a deep ensemble fake news detection model using the sequential deep learning technique. The proposed model was constructed in three phases. In the first phase, features were extracted from news contents, preprocessed using natural language processing techniques, enriched using n-gram, and represented using the term frequency-inverse term frequency technique. In the second phase, an ensemble model based on deep learning was constructed as follows. Multiple binary classifiers were trained using sequential deep learning networks to extract the representative hidden features that could accurately classify news types. In the third phase, a multi-class classifier was constructed based on multilayer perceptron (MLP) and trained using the features extracted from the aggregated outputs of the deep learning-based binary classifiers for final classification. The two popular and well-known datasets (LIAR and ISOT) were used with different classifiers to benchmark the proposed model. Compared with the state-of-the-art models, which use deep contextualized representation with convolutional neural network (CNN), the proposed model shows significant improvements (2.41%) in the overall performance in terms of the F1score for the LIAR dataset, which is more challenging than other datasets. Meanwhile, the proposed model achieves 100% accuracy with ISOT. The study demonstrates that traditional features extracted from news content with proper model design outperform the existing models that were constructed based on text embedding techniques.
引用
收藏
页数:18
相关论文
共 48 条
[1]   Analysis of Classifiers for Fake News Detection [J].
Agarwala, Vasu ;
Sultanaa, H. Parveen ;
Malhotra, Srijan ;
Sarkar, Amitrajit .
2ND INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ADVANCED COMPUTING ICRTAC -DISRUP - TIV INNOVATION , 2019, 2019, 165 :377-383
[2]   Detection of Online Fake News Using N-Gram Analysis and Machine Learning Techniques [J].
Ahmed, Hadeer ;
Traore, Issa ;
Saad, Sherif .
INTELLIGENT, SECURE, AND DEPENDABLE SYSTEMS IN DISTRIBUTED AND CLOUD ENVIRONMENTS (ISDDC 2017), 2017, 10618 :127-138
[3]   A Pseudo Feedback-Based Annotated TF-IDF Technique for Dynamic Crypto-Ransomware Pre-Encryption Boundary Delineation and Features Extraction [J].
Al-Rimy, Bander Ali Saleh ;
Maarof, Mohd Aiziani ;
Alazab, Mamoun ;
Alsolami, Fawaz ;
Shaid, Syed Zainudeen Mohd ;
Ghaleb, Fuad A. ;
Al-Hadhrami, Tawfik ;
Ali, Abdullah Marish .
IEEE ACCESS, 2020, 8 :140586-140598
[4]   Evaluating Intelligent Methods for Detecting COVID-19 Fake News on Social Media Platforms [J].
Alhakami, Hosam ;
Alhakami, Wajdi ;
Baz, Abdullah ;
Faizan, Mohd ;
Khan, Mohd Waris ;
Agrawal, Alka .
ELECTRONICS, 2022, 11 (15)
[5]   Social Media and Fake News in the 2016 Election [J].
Allcott, Hunt ;
Gentzkow, Matthew .
JOURNAL OF ECONOMIC PERSPECTIVES, 2017, 31 (02) :211-235
[6]  
Ansar W., 2021, INT J INFORM MANAGE, V1, P100052, DOI DOI 10.1016/J.JJIMEI.2021.100052
[7]   Fake News Detection using Bi-directional LSTM-Recurrent Neural Network [J].
Bahad, Pritika ;
Saxena, Preeti ;
Kamal, Raj .
2ND INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ADVANCED COMPUTING ICRTAC -DISRUP - TIV INNOVATION , 2019, 2019, 165 :74-82
[8]   A survey on fake news and rumour detection techniques [J].
Bondielli, Alessandro ;
Marcelloni, Francesco .
INFORMATION SCIENCES, 2019, 497 :38-55
[9]   Influence of fake news in Twitter during the 2016 US presidential election [J].
Bovet, Alexandre ;
Makse, Hernan A. .
NATURE COMMUNICATIONS, 2019, 10 (1)
[10]  
Cavnar W.B., 1994, N-gram-based text categorization, VVolume 161175