In-Depth Analysis of Phishing Email Detection: Evaluating the Performance of Machine Learning and Deep Learning Models Across Multiple Datasets

被引:0
|
作者
Alhuzali, Abeer [1 ]
Alloqmani, Ahad [1 ]
Aljabri, Manar [1 ]
Alharbi, Fatemah [2 ]
机构
[1] King Abdulaziz Univ, Fac Comp & Informat Technol, Dept Comp Sci, Jeddah 21589, Saudi Arabia
[2] Taibah Univ, Coll Comp Sci & Engn, Comp Sci Dept, Yanbu 46522, Saudi Arabia
来源
APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 06期
关键词
phishing email detection; cybersecurity; artificial intelligence (AI); deep learning (DL); machine learning (ML); spam filtering; threat detection; transformer models;
D O I
10.3390/app15063396
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Phishing emails remain a primary vector for cyberattacks, necessitating advanced detection mechanisms. Existing studies often focus on limited datasets or a small number of models, lacking a comprehensive evaluation approach. This study develops a novel framework for implementing and testing phishing email detection models to address this gap. A total of fourteen machine learning (ML) and deep learning (DL) models are evaluated across ten datasets, including nine publicly available datasets and a merged dataset created for this study. The evaluation is conducted using multiple performance metrics to ensure a comprehensive comparison. Experimental results demonstrate that DL models consistently outperform their ML counterparts in both accuracy and robustness. Notably, transformer-based models BERT and RoBERTa achieve the highest detection accuracies of 98.99% and 99.08%, respectively, on the balanced merged dataset, outperforming traditional ML approaches by an average margin of 4.7%. These findings highlight the superiority of DL in phishing detection and emphasize the potential of AI-driven solutions in strengthening email security systems. This study provides a benchmark for future research and sets the stage for advancements in cybersecurity innovation.
引用
收藏
页数:30
相关论文
共 50 条
  • [21] Across the Spectrum In-Depth Review AI-Based Models for Phishing Detection
    Ahmad, Shakeel
    Zaman, Muhammad
    Al-Shamayleh, Ahmad Sami
    Ahmad, Rahiel
    Abdulhamid, Shafi'I Muhammad
    Ergen, Ismail
    Akhunzada, Adnan
    IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2025, 6 : 2065 - 2089
  • [22] A Comparison of Natural Language Processing and Machine Learning Methods for Phishing Email Detection
    Bountakas, Panagiotis
    Koutroumpouchos, Konstantinos
    Xenakis, Christos
    ARES 2021: 16TH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY AND SECURITY, 2021,
  • [23] Heuristic machine learning approaches for identifying phishing threats across web and email platforms
    Jayaprakash, Ramprasath
    Natarajan, Krishnaraj
    Daniel, J. Alfred
    Chinnappan, Chandru Vignesh
    Giri, Jayant
    Qin, Hong
    Mallik, Saurav
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
  • [24] Phishing Website Detection Using Deep Learning Models
    Zara, Ume
    Ayyub, Kashif
    Khan, Hikmat Ullah
    Daud, Ali
    Alsahfi, Tariq
    Ahmad, Saima Gulzar
    IEEE ACCESS, 2024, 12 : 167072 - 167087
  • [25] Cloud-based email phishing attack using machine and deep learning algorithm
    Umer Ahmed Butt
    Rashid Amin
    Hamza Aldabbas
    Senthilkumar Mohan
    Bader Alouffi
    Ali Ahmadian
    Complex & Intelligent Systems, 2023, 9 : 3043 - 3070
  • [26] Cloud-based email phishing attack using machine and deep learning algorithm
    Butt, Umer Ahmed
    Amin, Rashid
    Aldabbas, Hamza
    Mohan, Senthilkumar
    Alouffi, Bader
    Ahmadian, Ali
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (03) : 3043 - 3070
  • [27] Trust in Intrusion Detection Systems: An Investigation of Performance Analysis for Machine Learning and Deep Learning Models
    Mahbooba, Basim
    Sahal, Radhya
    Alosaimi, Wael
    Serrano, Martin
    COMPLEXITY, 2021, 2021
  • [28] Business Email Compromise Phishing Detection Based on Machine Learning: A Systematic Literature Review
    Atlam, Hany F.
    Oluwatimilehin, Olayonu
    ELECTRONICS, 2023, 12 (01)
  • [29] Efficient machine learning models across multiple datasets for autism spectrum disorder diagnoses
    Rashed, Amr E. Eldin
    Bahgat, Waleed M.
    Ahmed, Ali
    Farrag, Tamer Ahmed
    Atwa, Ahmed E. Mansour
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 100
  • [30] DeepEPhishNet: a deep learning framework for email phishing detection using word embedding algorithms
    Somesha, M.
    Pais, Alwyn Roshan
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2024, 49 (03):