Detecting Phishing Domains Using Machine Learning

被引:19
作者
Alnemari, Shouq [1 ]
Alshammari, Majid [1 ]
机构
[1] Taif Univ, Coll Comp & Informat Technol, Taif 26571, Saudi Arabia
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 08期
关键词
phishing detection; machine learning; phishing domains; artificial neural networks; support vector machine; decision tree; random forest; FEATURE-SELECTION; ALGORITHM; PROTECTION; ENSEMBLE; WEBSITES; FEATURES;
D O I
10.3390/app13084649
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Phishing is an online threat where an attacker impersonates an authentic and trustworthy organization to obtain sensitive information from a victim. One example of such is trolling, which has long been considered a problem. However, recent advances in phishing detection, such as machine learning-based methods, have assisted in combatting these attacks. Therefore, this paper develops and compares four models for investigating the efficiency of using machine learning to detect phishing domains. It also compares the most accurate model of the four with existing solutions in the literature. These models were developed using artificial neural networks (ANNs), support vector machines (SVMs), decision trees (DTs), and random forest (RF) techniques. Moreover, the uniform resource locator's (URL's) UCI phishing domains dataset is used as a benchmark to evaluate the models. Our findings show that the model based on the random forest technique is the most accurate of the other four techniques and outperforms other solutions in the literature.
引用
收藏
页数:16
相关论文
共 71 条
[11]   Phishing website detection using support vector machines and nature-inspired optimization algorithms [J].
Anupam, Sagnik ;
Kar, Arpan Kumar .
TELECOMMUNICATION SYSTEMS, 2021, 76 (01) :17-32
[12]   A comprehensive survey of AI-enabled phishing attacks detection techniques [J].
Basit, Abdul ;
Zafar, Maham ;
Liu, Xuan ;
Javed, Abdul Rehman ;
Jalil, Zunera ;
Kifayat, Kashif .
TELECOMMUNICATION SYSTEMS, 2021, 76 (01) :139-154
[13]   An iterative boosting-based ensemble for streaming data classification [J].
Bertini Junior, Joao Roberto ;
Nicoletti, Maria do Carmo .
INFORMATION FUSION, 2019, 45 :66-78
[14]  
Bleau H., 2017, Global fraud and cybercrime forecast
[15]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[16]  
Brownlee J., 2020, Train-test split for evaluating machine learning algorithms
[17]   Cybersecurity education: Evolution of the discipline and analysis of master programs [J].
Cabaj, Krzysztof ;
Domingos, Duke ;
Kotulski, Zbigniew ;
Respicio, Ana .
COMPUTERS & SECURITY, 2018, 75 :24-35
[18]   AI@ntiPhish - Machine Learning Mechanisms for Cyber-Phishing Attack [J].
Chen, Yu-Hung ;
Chen, Jiann-Liang .
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (05) :878-887
[19]   A new hybrid ensemble feature selection framework for machine learning-based phishing detection system [J].
Chiew, Kang Leng ;
Tan, Choon Lin ;
Wong, KokSheik ;
Yong, Kelvin S. C. ;
Tiong, Wei King .
INFORMATION SCIENCES, 2019, 484 :153-166
[20]   Re-engineering Cybersecurity Education in the US: An Analysis of the Critical Factors [J].
Conklin, Wm Arthur ;
Cline, Raymond E., Jr. ;
Roosa, Tiffany .
2014 47TH HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS), 2014, :2006-2014