Detecting Phishing Domains Using Machine Learning

被引:19
作者
Alnemari, Shouq [1 ]
Alshammari, Majid [1 ]
机构
[1] Taif Univ, Coll Comp & Informat Technol, Taif 26571, Saudi Arabia
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 08期
关键词
phishing detection; machine learning; phishing domains; artificial neural networks; support vector machine; decision tree; random forest; FEATURE-SELECTION; ALGORITHM; PROTECTION; ENSEMBLE; WEBSITES; FEATURES;
D O I
10.3390/app13084649
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Phishing is an online threat where an attacker impersonates an authentic and trustworthy organization to obtain sensitive information from a victim. One example of such is trolling, which has long been considered a problem. However, recent advances in phishing detection, such as machine learning-based methods, have assisted in combatting these attacks. Therefore, this paper develops and compares four models for investigating the efficiency of using machine learning to detect phishing domains. It also compares the most accurate model of the four with existing solutions in the literature. These models were developed using artificial neural networks (ANNs), support vector machines (SVMs), decision trees (DTs), and random forest (RF) techniques. Moreover, the uniform resource locator's (URL's) UCI phishing domains dataset is used as a benchmark to evaluate the models. Our findings show that the model based on the random forest technique is the most accurate of the other four techniques and outperforms other solutions in the literature.
引用
收藏
页数:16
相关论文
共 71 条
[21]  
corporatefinanceinstitute, BAGG BOOTSTR AGGR OV
[22]  
Creswell J.W., 2011, PsycEXTRA Dataset
[23]  
Cristianini N., 2000, INTRO SUPPORT VECTOR
[24]  
datacamp, ADABOOST CLSS PYTH D
[25]   An In-Depth Benchmarking and Evaluation of Phishing Detection Research for Security Needs [J].
El Aassal, Ayman ;
Baki, Shahryar ;
Das, Avisha ;
Verma, Rakesh M. .
IEEE ACCESS, 2020, 8 (08) :22170-22192
[26]  
Friedman Jerome H, 2017, The Elements of Statistical Learning Data Mining, Inference, and Prediction
[27]   A Survey on Ensemble Learning for Data Stream Classification [J].
Gomes, Heitor Murilo ;
Barddal, Jean Paul ;
Enembreck, Fabricio ;
Bifet, Albert .
ACM COMPUTING SURVEYS, 2017, 50 (02)
[28]   Fighting against phishing attacks: state of the art and future challenges [J].
Gupta, B. B. ;
Tewari, Aakanksha ;
Jain, Ankit Kumar ;
Agrawal, Dharma P. .
NEURAL COMPUTING & APPLICATIONS, 2017, 28 (12) :3629-3654
[29]  
Hulten G.J., 2014, FINDING PHISHING SIT
[30]  
Hutchinson S., 2018, INT C MACHINE LEARNI, V251, DOI 10.1007/978-3-030-00557-3_46