Fake News Detection Using Machine Learning and Deep Learning Methods

被引:2
作者
Saeed, Ammar [1 ]
Al Solami, Eesa [2 ]
机构
[1] COMSATS Univ Islamabad, Dept Comp Sci, Wah Cantt, Pakistan
[2] Univ Jeddah, Coll Comp Sci & Engn, Dept Cybersecur, Jeddah 21959, Saudi Arabia
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2023年 / 77卷 / 02期
关键词
Machine learning; deep learning; fake news; feature extraction; SOCIAL MEDIA; INFORMATION;
D O I
10.32604/cmc.2023.030551
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The evolution of the internet and its accessibility in the twenty-first century has resulted in a tremendous increase in the use of social media platforms. Some social media sources contribute to the propagation of fake news that has no real validity, but they accumulate over time and begin to appear in the feed of every consumer producing even more ambiguity. To sustain the value of social media, such stories must be distinguished from the true ones. As a result, an automated system is required to save time and money. The classification of fake news and misinformation from social media data corpora is the subject of this research. Several preprocessing and data improvement procedures are used to gather and preprocess two fake news datasets. Deep text features are extracted using word embedding models Word2vec and Global Vectors for Word representation while textual features are extracted using n-gram approaches named Term Frequency-Inverse Document Frequency and Bag of Words from both datasets individually. Bidirectional Encoder Representations from Transformers (BERT) is also employed to derive embedded representations from the input data. Finally, three Machine Learning (ML) and two Deep Learning (DL) algorithms are utilized for fake news classification. BERT also carries out the classification of embedded outcomes generated by it in parallel with the ML and DL models. In terms of overall performance, the DL-based Convolutional Neural Network stands out in the case of the first while BERT performs better in the case of the second dataset.
引用
收藏
页码:2079 / 2096
页数:18
相关论文
共 29 条
[21]  
Monti F, 2019, Arxiv, DOI [arXiv:1902.06673, DOI 10.48550/ARXIV.1902.06673]
[22]  
Nagi K., 2018, ICSSM P JUL, P77
[23]   Social media, knowledge translation, and action on the social determinants of health and health equity: A survey of public health practices [J].
Ndumbe-Eyoh, Sume ;
Mazzucco, Agnes .
JOURNAL OF PUBLIC HEALTH POLICY, 2016, 37 :S249-S259
[24]  
Ni R, 2020, CHIN CONTR CONF, P7492, DOI 10.23919/CCC50068.2020.9188578
[25]  
Aragao JMN, 2018, REV BRAS ENFERM, V71, P265, DOI [10.1590/00347167-2016-0604, 10.1590/0034-7167-2016-0604]
[26]  
Perez-Rosas Veronica, 2017, COLING 2018 27 INT C
[27]   A New Application of Social Impact in Social Media for Overcoming Fake News in Health [J].
Pulido, Cristina M. ;
Ruiz-Eugenio, Laura ;
Redondo-Sama, Gisela ;
Villarejo-Carballido, Beatriz .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2020, 17 (07)
[28]   Sentiment analysis based on improved pre-trained word embeddings [J].
Rezaeinia, Seyed Mahdi ;
Rahmani, Rouhollah ;
Ghodsi, Ali ;
Veisi, Hadi .
EXPERT SYSTEMS WITH APPLICATIONS, 2019, 117 :139-147
[29]   Information dissemination model for social media with constant updates [J].
Zhu, Hui ;
Wu, Heng ;
Cao, Jin ;
Fu, Gang ;
Li, Hui .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2018, 502 :469-482