In-Depth Analysis of Phishing Email Detection: Evaluating the Performance of Machine Learning and Deep Learning Models Across Multiple Datasets

被引：0

作者：

Alhuzali, Abeer ^{[1
]}

Alloqmani, Ahad ^{[1
]}

Aljabri, Manar ^{[1
]}

Alharbi, Fatemah ^{[2
]}

机构：

[1] King Abdulaziz Univ, Fac Comp & Informat Technol, Dept Comp Sci, Jeddah 21589, Saudi Arabia

[2] Taibah Univ, Coll Comp Sci & Engn, Comp Sci Dept, Yanbu 46522, Saudi Arabia

来源：

APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 06期

关键词：

phishing email detection; cybersecurity; artificial intelligence (AI); deep learning (DL); machine learning (ML); spam filtering; threat detection; transformer models;

D O I：

10.3390/app15063396

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

Phishing emails remain a primary vector for cyberattacks, necessitating advanced detection mechanisms. Existing studies often focus on limited datasets or a small number of models, lacking a comprehensive evaluation approach. This study develops a novel framework for implementing and testing phishing email detection models to address this gap. A total of fourteen machine learning (ML) and deep learning (DL) models are evaluated across ten datasets, including nine publicly available datasets and a merged dataset created for this study. The evaluation is conducted using multiple performance metrics to ensure a comprehensive comparison. Experimental results demonstrate that DL models consistently outperform their ML counterparts in both accuracy and robustness. Notably, transformer-based models BERT and RoBERTa achieve the highest detection accuracies of 98.99% and 99.08%, respectively, on the balanced merged dataset, outperforming traditional ML approaches by an average margin of 4.7%. These findings highlight the superiority of DL in phishing detection and emphasize the potential of AI-driven solutions in strengthening email security systems. This study provides a benchmark for future research and sets the stage for advancements in cybersecurity innovation.

引用

页数：30