Phishing detection based on machine learning and feature selection methods

被引:12
作者
Almseidin M. [1 ]
Abu Zuraiq A.M. [2 ]
Al-kasassbeh M. [2 ]
Alnidami N. [3 ]
机构
[1] Department of Information Technology, University of Miskolc, Miskolc
[2] Princess Sumaya University for Technology, Amman
[3] National Information Technology Center, Amman
关键词
Feature selection; Machine learning; Multilayer perceptron; Phishing detection; Random forest;
D O I
10.3991/ijim.v13i12.11411
中图分类号
学科分类号
摘要
With increasing technology developments, the Internet has become everywhere and accessible by everyone. There are a considerable number of web-pages with different benefits. Despite this enormous number, not all of these sites are legitimate. There are so-called phishing sites that deceive users into serving their interests. This paper dealt with this problem using machine learning algorithms in addition to employing a novel dataset that related to phishing detection, which contains 5000 legitimate web-pages and 5000 phishing ones. In order to obtain the best results, various machine learning algorithms were tested. Then J48, Random forest, and Multilayer perceptron were chosen. Different feature selection tools were employed to the dataset in order to improve the efficiency of the models. The best result of the experiment achieved by utilizing 20 features out of 48 features and applying it to Random forest algorithm. The accuracy was 98.11%. © 2019, International Association of Online Engineering.
引用
收藏
页码:71 / 183
页数:112
相关论文
共 23 条
[1]  
Phishing activity trends report 4 th quarter, (2018)
[2]  
24, (2019)
[3]  
Shirazi H., Unbiased phishing detection using domain name based features
[4]  
Altaher A., Phishing websites classification using hybrid svm and knn approach, International Journal of Advanced Computer Science and Applications, 8, 6, pp. 90-95, (2017)
[5]  
Chen Y.-S., Yu Y.-H., Liu H.-S., Wang P., Detect phishing by checking content consistency, Proceedings of the 2014 IEEE 15th International Conference on Information Reuse and Integration (IEEE IRI2014), pp. 109-119, (2014)
[6]  
Abdelhamid N., Ayesh A., Thabtah F., Phishing detection based associative classification data mining, Expert Systems with Applications, 41, 13, pp. 5948-5959, (2014)
[7]  
Moghimi M., Varjani A.Y., New rule-based phishing detection method, Expert systems with applications, 53, pp. 231-242, (2016)
[8]  
Aburrous M., Hossain M.A., Dahal K., Thabtah F., Intelligent phishing detection system for e-banking using fuzzy data mining, Expert systems with applications, 37, 12, pp. 7913-7921, (2010)
[9]  
Chiew K.L., Chang E.H., Tiong W.K., Et al., Utilisation of website logo for phishing detection, Computers & Security, 54, pp. 16-26, (2015)
[10]  
Solanki J., Vaishnav R.G., Website phishing detection using heuristic based approach, Proceedings of the third international conference on advances in computing, electronics and electrical technology, (2015)