An Empirical Comparison of Machine Learning and Deep Learning Models for Automated Fake News Detection

被引：0

作者：

Tian, Yexin ^{[1
]}

Xu, Shuo ^{[2
]}

Cao, Yuchen ^{[3
]}

Wang, Zhongyan ^{[4
]}

Wei, Zijing ^{[5
]}

机构：

[1] Georgia Inst Technol, Coll Comp, Atlanta, GA 30332 USA

[2] Univ Calif San Diego, Comp Sci & Engn Dept, La Jolla, CA 92093 USA

[3] Northeastern Univ, Khoury Coll Comp Sci, Seattle, WA 98109 USA

[4] NYU, Ctr Data Sci, New York, NY 10011 USA

[5] Univ Illinois, Coll Liberal Arts & Sci, Urbana, IL 61801 USA

来源：

MATHEMATICS | 2025年 / 13卷 / 13期

关键词：

fake news detection; natural language processing; machine learning; deep learning; text classification; model interpretability;

D O I：

10.3390/math13132086

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

Detecting fake news is a critical challenge in natural language processing (NLP), demanding solutions that balance accuracy, interpretability, and computational efficiency. Despite advances in NLP, systematic empirical benchmarks that directly compare both classical and deep models-across varying input richness and with careful attention to interpretability and computational tradeoffs-remain underexplored. In this study, we systematically evaluate the mathematical foundations and empirical performance of five representative models for automated fake news classification: three classical machine learning algorithms (Logistic Regression, Random Forest, and Light Gradient Boosting Machine) and two state-of-the-art deep learning architectures (A Lite Bidirectional Encoder Representations from Transformers-ALBERT and Gated Recurrent Units-GRUs). Leveraging the large-scale WELFake dataset, we conduct rigorous experiments under both headline-only and headline-plus-content input scenarios, providing a comprehensive assessment of each model's capability to capture linguistic, contextual, and semantic cues. We analyze each model's optimization framework, decision boundaries, and feature importance mechanisms, highlighting the empirical tradeoffs between representational capacity, generalization, and interpretability. Our results show that transformer-based models, especially ALBERT, achieve state-of-the-art performance (macro F1 up to 0.99) with rich context, while classical ensembles remain viable for constrained settings. These findings directly inform practical fake news detection.

引用

页数：24

共 43 条

[1] Social Media and Fake News in the 2016 Election [J].

Allcott, Hunt ;

Gentzkow, Matthew .

JOURNAL OF ECONOMIC PERSPECTIVES, 2017, 31 (02) :211-235

[2]

Bang Yejin, 2021, Combating Online Hostile Posts in Regional Languages during Emergency Situation: First International Workshop, CONSTRAINT 2021, Collocated with AAAI 2021. Communications in Computer and Information Science (1402), P128, DOI 10.1007/978-3-030-73696-5_13

[3]

Bay YY, 2024, Arxiv, DOI arXiv:2403.01621

[4]

Breiman L., 1984, Classification and regression trees, DOI DOI 10.1201/9781315139470

[5]

Breiman L., 2001, MACHINE LEARNING, V45, P5

[6]

Cao Y., 2025, J. Behav. Data Sci, V5, P1, DOI [10.35566/jbds/caoyc, DOI 10.35566/JBDS/CAOYC]

[7]

Chawla N. V., 2004, ACM SIGKDD EXPLORATI, V6, P1, DOI DOI 10.1145/1007730.1007733

[8]

Cho K., P 2014 C EMP METH NA

[9]

Conroy NJ., 2015, Proc Assoc Inform Sci Technol, V52, P1, DOI DOI 10.1002/PRA2.2015.145052010082

[10] The Effect of Phrase Vector Embedding in Explainable Hierarchical Attention-Based Tamil Code-Mixed Hate Speech and Intent Detection [J].

Devi, V. Sharmila ;

Kannimuthu, S. ;

Madasamy, Anand Kumar .

IEEE ACCESS, 2024, 12 :11316-11329

← 1 2 3 4 5 →