Fake News Detection on Social Media Using Ensemble Methods

被引:0
作者
Ilyas, Muhammad Ali [1 ]
Rehman, Abdul [2 ]
Abbas, Assad [1 ]
Kim, Dongsun [3 ]
Naseem, Muhammad Tahir [4 ]
Allah, Nasro Min [5 ]
机构
[1] COMSATS Univ, Dept Comp Sci, Islamabad 45550, Pakistan
[2] Kyungpook Natl Univ, Sch Comp Sci & Engn, Daegu 41566, South Korea
[3] Korea Univ, Dept Comp Sci & Engn, Seoul 02841, South Korea
[4] Yeungnam Univ, Dept Elect Engn, Gyongsan 38541, South Korea
[5] Imam Abdulrahman Bin Faisal Univ, Coll Comp Sci & Informat Technol, Dept Comp Sci, Dammam 34223, Saudi Arabia
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 81卷 / 03期
关键词
Fake news detection; Machine Learning (ML); Deep Learning (DL); Chi-Square; ensembling;
D O I
10.32604/cmc.2024.056291
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In an era dominated by information dissemination through various channels like newspapers, social media, radio, and television, the surge in content production, especially on social platforms, has amplified the challenge of distinguishing between truthful and deceptive information. Fake news, a prevalent issue, particularly on social media, complicates the assessment of news credibility. The pervasive spread of fake news not only misleads the public but also erodes trust in legitimate news sources, creating confusion and polarizing opinions. As the volume of information grows, individuals increasingly struggle to discern credible content from false narratives, leading to widespread misinformation and potentially harmful consequences. Despite numerous methodologies proposed for fake news detection, including knowledge-based, language-based, and machine-learning approaches, their efficacy often diminishes when confronted with high-dimensional datasets and data riddled with noise or inconsistencies. Our study addresses this challenge by evaluating the synergistic benefits of combining feature extraction and feature selection techniques in fake news detection. We employ multiple feature extraction methods, including Count Vectorizer, Bag of Words, Global Vectors for Word Representation (GloVe), Word to Vector (Word2Vec), and Term Frequency-Inverse Document Frequency (TF-IDF), alongside feature selection techniques such as Information Gain, Chi-Square, Principal Component Analysis (PCA), and Document Frequency. This comprehensive approach enhances the model's ability to identify and analyze relevant features, leading to more accurate and effective fake news detection. Our findings highlight the importance of a multi-faceted approach, offering a significant improvement in model accuracy and reliability. Moreover, the study emphasizes the adaptability of the proposed ensemble model across diverse datasets, reinforcing its potential for broader application in real-world scenarios. We introduce a pioneering ensemble technique that leverages both machine-learning and deep-learning classifiers. To identify the optimal ensemble configuration, we systematically tested various combinations. Experimental evaluations conducted on three diverse datasets related to fake news demonstrate the exceptional performance of our proposed ensemble model. Achieving remarkable accuracy levels of 97%, 99%, and 98% on Dataset 1, Dataset 2, and Dataset 3, respectively, our approach showcases robustness and effectiveness in discerning fake news amidst the complexities of contemporary information landscapes. This research contributes to the advancement of fake news detection methodologies and underscores the significance of integrating feature extraction and feature selection strategies for enhanced performance, especially in the context of intricate, high-dimensional datasets.
引用
收藏
页码:4525 / 4549
页数:25
相关论文
共 34 条
  • [1] Adiba F.I., 2020, Int. J. Automat., Artif. Intell. Mach. Learn., V1, P80, DOI DOI 10.61797/IJAAIML.V1I1.45
  • [2] Detecting Fake News in Social Media Networks
    Aldwairi, Monther
    Alwahedi, Ali
    [J]. 9TH INTERNATIONAL CONFERENCE ON EMERGING UBIQUITOUS SYSTEMS AND PERVASIVE NETWORKS (EUSPN-2018) / 8TH INTERNATIONAL CONFERENCE ON CURRENT AND FUTURE TRENDS OF INFORMATION AND COMMUNICATION TECHNOLOGIES IN HEALTHCARE (ICTH-2018), 2018, 141 : 215 - 222
  • [3] All Your Fake Detector are Belong to Us: Evaluating Adversarial Robustness of Fake-News Detectors Under Black-Box Settings
    Ali, Hassan
    Khan, Muhammad Suleman
    Alghadhban, Amer
    Alazmi, Meshari
    Alzamil, Ahmad
    Al-Utaibi, Khaled
    Qadir, Junaid
    [J]. IEEE ACCESS, 2021, 9 : 81678 - 81692
  • [4] The Impact of Term Fake News on the Scientific Community. Scientific Performance and Mapping in Web of Science
    Alonso Garcia, Santiago
    Gomez Garcia, Gerardo
    Sanz Prieto, Mariano
    Moreno Guerrero, Antonio Jose
    Rodriguez Jimenez, Carmen
    [J]. SOCIAL SCIENCES-BASEL, 2020, 9 (05):
  • [5] Archambault A., 2012, Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, P2741, DOI [10.1145/2207676.2208671, DOI 10.1145/2207676.2208671]
  • [6] Bharadwaj P., 2019, INT J NAT LANG COMPU, V8
  • [7] Read All About It: The Politicization of "Fake News" on Twitter
    Brummette, John
    DiStaso, Marcia
    Vafeiadis, Michail
    Messner, Marcus
    [J]. JOURNALISM & MASS COMMUNICATION QUARTERLY, 2018, 95 (02) : 497 - 517
  • [8] Call Attention to Rumors: Deep Attention Based Recurrent Neural Networks for Early Rumor Detection
    Chen, Tong
    Li, Xue
    Yin, Hongzhi
    Zhang, Jun
    [J]. TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING: PAKDD 2018 WORKSHOPS, 2018, 11154 : 40 - 52
  • [9] da Silva FCD, 2019, PROCEEDINGS OF THE 52ND ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, P2763
  • [10] Girgis S, 2018, PROCEEDINGS OF 2018 13TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES), P93, DOI 10.1109/ICCES.2018.8639198