Enhanced Feature Selection Using Genetic Algorithm for Machine-Learning-Based Phishing URL Detection

被引:3
作者
Kocyigit, Emre [1 ]
Korkmaz, Mehmet [2 ]
Sahingoz, Ozgur Koray [3 ]
Diri, Banu [2 ]
机构
[1] Univ Luxembourg, Interdisciplinary Ctr Secur Reliabil & Trust SnT, 2 Ave Univ, L-4365 Esch Sur Alzette, Luxembourg, Luxembourg
[2] Yildiz Tech Univ, Dept Comp Engn, TR-34349 Besiktas, Istanbul, Turkiye
[3] Biruni Univ, Dept Comp Engn, TR-34100 Istanbul, Turkiye
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 14期
关键词
feature selection; genetic algorithm; phishing detection;
D O I
10.3390/app14146081
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In recent years, the importance of computer security has increased due to the rapid advancement of digital technology, widespread Internet use, and increased sophistication of cyberattacks. Machine learning has gained great interest in securing data systems because it offers the capability of automatically detecting and responding to security threats in real time, which is crucial for maintaining the security of computer systems and protecting data from malicious attacks. This study concentrates on phishing attack detection systems, a prevalent cyber-threat. These systems assess the features of the incoming requests to identify whether they are malicious or not. Although the number of features is increasing in these systems, feature selection has become an essential pre-processing phase that identifies the most important features of a set of available features to prevent overfitting problems, improve model performance, reduce computational cost, and decrease training and execution time. Leveraging genetic algorithms, known for simulating natural selection to identify optimal solutions, we propose a novel feature selection method, based on genetic algorithms and locally optimized, that is applied to a URL-based phishing detection system with machine learning models. Our research demonstrates that the proposed technique offers a promising strategy for improving the performance of machine learning models.
引用
收藏
页数:21
相关论文
共 47 条
[1]   Intelligent phishing detection scheme using deep learning algorithms [J].
Adebowale, Moruf Akin ;
Lwin, Khin T. ;
Hossain, M. A. .
JOURNAL OF ENTERPRISE INFORMATION MANAGEMENT, 2023, 36 (03) :747-766
[2]   Metaheuristic Algorithms on Feature Selection: A Survey of One Decade of Research (2009-2019) [J].
Agrawal, Prachi ;
Abutarboush, Hattan F. ;
Ganesh, Talari ;
Mohamed, Ali Wagdy .
IEEE ACCESS, 2021, 9 :26766-26791
[3]   Hybrid Filter and Genetic Algorithm-Based Feature Selection for Improving Cancer Classification in High-Dimensional Microarray Data [J].
Ali, Waleed ;
Saeed, Faisal .
PROCESSES, 2023, 11 (02)
[4]   Hybrid intelligent phishing website prediction using deep neural networks with genetic algorithm-based feature selection and weighting [J].
Ali, Waleed ;
Ahmed, Adel A. .
IET INFORMATION SECURITY, 2019, 13 (06) :659-669
[5]  
[Anonymous], 2023, Phishing Activity Trends Report 4th Quarter 2022
[6]  
[Anonymous], 2023, 2023 State of the Phish ReportPhishing Stats and Trends
[7]  
Belkarkor S., 2022, P ADV MACH INT COMP, P69
[8]  
Catak Ferhat Ozgur, 2015, WSEAS Transactions on Information Science and Applications, V12, P290
[9]   Applications of deep learning for phishing detection: a systematic literature review [J].
Catal, Cagatay ;
Giray, Gorkem ;
Tekinerdogan, Bedir ;
Kumar, Sandeep ;
Shukla, Suyash .
KNOWLEDGE AND INFORMATION SYSTEMS, 2022, 64 (06) :1457-1500
[10]   A survey on feature selection methods [J].
Chandrashekar, Girish ;
Sahin, Ferat .
COMPUTERS & ELECTRICAL ENGINEERING, 2014, 40 (01) :16-28