Enhancing Phishing Detection: A Machine Learning Approach With Feature Selection and Deep Learning Models

被引:1
作者
Nayak, Ganesh S. [1 ]
Muniyal, Balachandra [1 ]
Belavagi, Manjula C. [1 ]
机构
[1] Manipal Acad Higher Educ, Manipal Inst Technol, Dept Informat & Commun Technol, Manipal 576104, Karnataka, India
关键词
Phishing; Feature extraction; Accuracy; Electronic mail; Computer security; Deep learning; Machine learning; Uniform resource locators; Random forests; Optimization; Phishing detection; cybersecurity; deep learning; neural networks; feature selection; hyperparameter optimization; real-time detection; TabNet; wide and deep model;
D O I
10.1109/ACCESS.2025.3543738
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rise in cybercrime, phishing remains a significant concern as it targets individuals with fake websites, causing victims to disclose their private information. The effective implementation of phishing detection relies on cost efficiency, with the increased feature extraction factor contributing to these costs. This research analyzes a dataset containing 58,645 URLs, examining 111 features of the latest phishing websites dataset to identify the differences between phishing sites and legitimate sites. Astonishingly, using only 14 characteristics, the feedforward model achieved a remarkable accuracy of 94.46%, confirming the efficiency of Machine Learning in phishing detection. Through the exploitation of a multiple classifier collection, including Deep Neural Network (DNN), Wide and Deep, and TabNet, this research advances ongoing efforts to improve the accuracy and efficiency of phishing detection mechanisms and enhance cybersecurity defenses against malicious activities. The methodology introduces a new metric called the 'anti-phishing score,' which evaluates performance based on false positives and negatives, beyond traditional model accuracy. The model was trained through a robust design of extensive experimentation and hyperparameter-sensitive grid search, ensuring an optimized configuration for phishing detection. Furthermore, the trained model was validated on a new dataset to evaluate its generalizability, enhancing its practical applicability. Through the integration of feature selection principles, advanced algorithmic techniques, and comprehensive evaluation approaches, this research offers a robust approach to phishing detection, considering the evolving nature of cyber threats. The findings provide a beneficial framework for cybersecurity specialists and researchers, enabling more effective preventive measures against phishing attacks.
引用
收藏
页码:33308 / 33320
页数:13
相关论文
共 33 条
[1]   Phishing detection based on machine learning and feature selection methods [J].
Almseidin M. ;
Abu Zuraiq A.M. ;
Al-kasassbeh M. ;
Alnidami N. .
International Journal of Interactive Mobile Technologies, 2019, 13 (12) :71-183
[2]  
[Anonymous], 2021, P IEEE 12 ANN UB COM, P250
[3]  
[Anonymous], 2019, 2019 2 INT C COMP, DOI DOI 10.1109/cais.2019.8769571
[4]  
Baykara M, 2018, 2018 6TH INTERNATIONAL SYMPOSIUM ON DIGITAL FORENSIC AND SECURITY (ISDFS), P399
[5]  
Bondarenko I., 2021, arXiv
[6]  
Borisov V, 2022, Arxiv, DOI [arXiv:2110.01889, DOI 10.48550/ARXIV.2110.01889, 10.48550/arXiv.2110.01889]
[7]  
Cheng QS, 2023, Arxiv, DOI arXiv:2309.08799
[8]   A survey of phishing attacks: Their types, vectors and technical approaches [J].
Chiew, Kang Leng ;
Yong, Kelvin Sheng Chek ;
Tan, Choon Lin .
EXPERT SYSTEMS WITH APPLICATIONS, 2018, 106 :1-20
[9]  
Chinguwo M. R., 2023, Int. J. Res. Appl. Sci. Eng. Technol., V11, P360, DOI [10.22214/ijraset.2023.49422, DOI 10.22214/IJRASET.2023.49422]
[10]  
Chinnasamy P., 2022, P INT C ADV SMART SE, P1