Fake News Detection on Social Media Using Ensemble Methods

被引:0
作者
Ilyas, Muhammad Ali [1 ]
Rehman, Abdul [2 ]
Abbas, Assad [1 ]
Kim, Dongsun [3 ]
Naseem, Muhammad Tahir [4 ]
Allah, Nasro Min [5 ]
机构
[1] COMSATS Univ, Dept Comp Sci, Islamabad 45550, Pakistan
[2] Kyungpook Natl Univ, Sch Comp Sci & Engn, Daegu 41566, South Korea
[3] Korea Univ, Dept Comp Sci & Engn, Seoul 02841, South Korea
[4] Yeungnam Univ, Dept Elect Engn, Gyongsan 38541, South Korea
[5] Imam Abdulrahman Bin Faisal Univ, Coll Comp Sci & Informat Technol, Dept Comp Sci, Dammam 34223, Saudi Arabia
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 81卷 / 03期
关键词
Fake news detection; Machine Learning (ML); Deep Learning (DL); Chi-Square; ensembling;
D O I
10.32604/cmc.2024.056291
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In an era dominated by information dissemination through various channels like newspapers, social media, radio, and television, the surge in content production, especially on social platforms, has amplified the challenge of distinguishing between truthful and deceptive information. Fake news, a prevalent issue, particularly on social media, complicates the assessment of news credibility. The pervasive spread of fake news not only misleads the public but also erodes trust in legitimate news sources, creating confusion and polarizing opinions. As the volume of information grows, individuals increasingly struggle to discern credible content from false narratives, leading to widespread misinformation and potentially harmful consequences. Despite numerous methodologies proposed for fake news detection, including knowledge-based, language-based, and machine-learning approaches, their efficacy often diminishes when confronted with high-dimensional datasets and data riddled with noise or inconsistencies. Our study addresses this challenge by evaluating the synergistic benefits of combining feature extraction and feature selection techniques in fake news detection. We employ multiple feature extraction methods, including Count Vectorizer, Bag of Words, Global Vectors for Word Representation (GloVe), Word to Vector (Word2Vec), and Term Frequency-Inverse Document Frequency (TF-IDF), alongside feature selection techniques such as Information Gain, Chi-Square, Principal Component Analysis (PCA), and Document Frequency. This comprehensive approach enhances the model's ability to identify and analyze relevant features, leading to more accurate and effective fake news detection. Our findings highlight the importance of a multi-faceted approach, offering a significant improvement in model accuracy and reliability. Moreover, the study emphasizes the adaptability of the proposed ensemble model across diverse datasets, reinforcing its potential for broader application in real-world scenarios. We introduce a pioneering ensemble technique that leverages both machine-learning and deep-learning classifiers. To identify the optimal ensemble configuration, we systematically tested various combinations. Experimental evaluations conducted on three diverse datasets related to fake news demonstrate the exceptional performance of our proposed ensemble model. Achieving remarkable accuracy levels of 97%, 99%, and 98% on Dataset 1, Dataset 2, and Dataset 3, respectively, our approach showcases robustness and effectiveness in discerning fake news amidst the complexities of contemporary information landscapes. This research contributes to the advancement of fake news detection methodologies and underscores the significance of integrating feature extraction and feature selection strategies for enhanced performance, especially in the context of intricate, high-dimensional datasets.
引用
收藏
页码:4525 / 4549
页数:25
相关论文
共 34 条
[21]   The anatomy of 'fake news': Studying false messages as digital objects [J].
Khan, Ali ;
Brohman, Kathryn ;
Addas, Shamel .
JOURNAL OF INFORMATION TECHNOLOGY, 2022, 37 (02) :122-143
[22]   The science of fake news [J].
Lazer, David M. J. ;
Baum, Matthew A. ;
Benkler, Yochai ;
Berinsky, Adam J. ;
Greenhill, Kelly M. ;
Menczer, Filippo ;
Metzger, Miriam J. ;
Nyhan, Brendan ;
Pennycook, Gordon ;
Rothschild, David ;
Schudson, Michael ;
Sloman, Steven A. ;
Sunstein, Cass R. ;
Thorson, Emily A. ;
Watts, Duncan J. ;
Zittrain, Jonathan L. .
SCIENCE, 2018, 359 (6380) :1094-1096
[23]  
Ledford H, 2020, NATURE, V582, DOI 10.1038/d41586-020-01747-1
[24]  
Liddy E.D., 2001, Natural language processing
[25]   Fake news detection and classification using hybrid BiLSTM and self-attention model [J].
Mohapatra, Asutosh ;
Thota, Nithin ;
Prakasam, P. .
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (13) :18503-18519
[26]  
Napoli P., 2018, Federal Communications Law Journal, V70, P55
[27]  
Oshikawa R, 2020, Arxiv, DOI arXiv:1811.00770
[28]  
Ramos Juan., 2003, USING TF IDF DETERMI, V242, P29
[29]  
Singh V., 2017, INT C SOC COMP BEH C, P1, DOI [10.13140/RG.2.2.16825.67687, DOI 10.13140/RG.2.2.16825.67687]
[30]  
Stahl K., 2018, Calif. State Univ. Stanislaus