Phishing Website Detection from URLs Using Classical Machine Learning ANN Model

被引:5
作者
Salloum, Said [1 ,2 ]
Gaber, Tarek [1 ,3 ]
Vadera, Sunil [1 ]
Shaalan, Khaled [4 ]
机构
[1] Univ Salford, Sch Sci Engn & Environm, Salford, Lancs, England
[2] Univ Sharjah, Machine Learning & NLP Res Grp, Sharjah, U Arab Emirates
[3] Suez Canal Univ, Fac Comp & Informat, Ismailia 41522, Egypt
[4] British Univ Dubai, Fac Engn & IT, Dubai, U Arab Emirates
来源
SECURITY AND PRIVACY IN COMMUNICATION NETWORKS, SECURECOMM 2021, PT II | 2021年 / 399卷
关键词
Fraud protection; Cybersecurity; Machine learning; Phishing Detection; URL;
D O I
10.1007/978-3-030-90022-9_28
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Phishing is a serious form of online fraud made up of spoofed websites that attempt to gain users' sensitive information by tricking them into believing that they are visiting a legitimate site. Phishing attacks can be detected many ways, including a user's awareness of fraud protection, blacklisting websites, analyzing the suspected characteristics, or comparing them to recent attempts that followed similar patterns. The purpose of this paper is to create classification models using features extracted from websites to study and classify phishing websites. In order to train the system, we use two datasets consisting of 58,645 and 88,647 URLs labeled as "Phishing" or "Legitimate". A diverse range of machine learning models such as "XGBOOST, Support Vector Machine (SVM), Random Forest (RF), k-nearest neighbor (KNN), Artificial neural network (ANN), Logistic Regression (LR), Decision tree (DT), and Gaussian naive Bayes (NB)" classifiers are evaluated. ANN provided the best performance with 97.63% accuracy for detecting phishing URLs in experiments. Such a study would be valuable to the scientific community, especially to researchers who work on phishing attack detection and prevention.
引用
收藏
页码:509 / 523
页数:15
相关论文
共 37 条
[1]  
[Anonymous], 2012, P STUD FAC RES DAY C
[2]  
[Anonymous], 2007, P 16 INT C WORLD WID, DOI DOI 10.1145/1242572.1242660
[3]  
Anti-Phishing Working Group, PHISH ACT TRENDS REP
[4]   Going Spear Phishing: Exploring Embedded Training and Awareness [J].
Caputo, Deanna D. ;
Pfleeger, Shari Lawrence ;
Freeman, Jesse D. ;
Johnson, M. Eric .
IEEE SECURITY & PRIVACY, 2014, 12 (01) :28-38
[5]   A new hybrid ensemble feature selection framework for machine learning-based phishing detection system [J].
Chiew, Kang Leng ;
Tan, Choon Lin ;
Wong, KokSheik ;
Yong, Kelvin S. C. ;
Tiong, Wei King .
INFORMATION SCIENCES, 2019, 484 :153-166
[6]  
Christou O., ARXIV200506599
[7]  
Das Bhattacharjee S, 2017, 2017 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS (ISI), P107, DOI 10.1109/ISI.2017.8004883
[8]  
Ferreira R., 2018, Soc. Netw., V7, P97, DOI 10.4236/sn.2018.72008
[9]   From Feature Engineering and Topics Models to Enhanced Prediction Rates in Phishing Detection [J].
Gualberto, Eder S. ;
De Sousa, Rafael T., Jr. ;
Vieira, Thiago P. De B. ;
Da Costa, Joao Paulo C. L. ;
Duque, Claudio G. .
IEEE ACCESS, 2020, 8 :76368-76385
[10]  
Gunawardena S, 2013, PROCEEDINGS OF THE 2013 8TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2013), P383