Particle Swarm Optimization-Based Feature Weighting for Improving Intelligent Phishing Website Detection

被引:45
作者
Ali, Waleed [1 ]
Malebary, Sharaf [1 ]
机构
[1] King Abdulaziz Univ, Fac Comp & Informat Technol, Informat Technol Dept, Jeddah 21589, Saudi Arabia
关键词
Feature weighting; machine learning; particle swarm optimization; phishing website; FEATURE-SELECTION; CLASSIFICATION;
D O I
10.1109/ACCESS.2020.3003569
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Over the last few years, web phishing attacks have been constantly evolving causing customers to lose trust in e-commerce and online services. Various tools and systems based on a blacklist of phishing websites are applied to detect the phishing websites. Unfortunately, the fast evolution of technology has led to the born of more sophisticated methods when building websites to attract users. Thus, the latest and newly deployed phishing websites; for example, zero-day phishing websites, cannot be detected by using these blacklist-based approaches. Several recent research studies have been adopting machine learning techniques to identify phishing websites and utilizing them as an early alarm method to identify such threats. However, the important website features have been selected based on human experience or frequency analysis of website features in most of these approaches. In this paper, intelligent phishing website detection using particle swarm optimization-based feature weighting is proposed to enhance the detection of phishing websites. The proposed approach suggests utilizing particle swarm optimization (PSO) to weight various website features effectively to achieve higher accuracy when detecting phishing websites. In particular, the proposed PSO-based website feature weighting is used to differentiate between the various features in websites, based on how important they contribute towards recognizing the phishing from legitimate websites. The experimental results indicated that the proposed PSO-based feature weighting achieved outstanding improvements in terms of classification accuracy, true positive and negative rates, and false positive and negative rates of the machine learning models using only fewer websites features utilized in the detection of phishing websites.
引用
收藏
页码:116766 / 116780
页数:15
相关论文
共 43 条
[1]   Phishing detection based Associative Classification data mining [J].
Abdelhamid, Neda ;
Ayesh, Aladdin ;
Thabtah, Fadi .
EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (13) :5948-5959
[2]   A new feature selection method to improve the document clustering using particle swarm optimization algorithm [J].
Abualigah, Laith Mohammad ;
Khader, Ahamad Tajudin ;
Hanandeh, Essam Said .
JOURNAL OF COMPUTATIONAL SCIENCE, 2018, 25 :456-466
[3]  
Aburrous Maher, 2010, Proceedings of the Seventh International Conference on Information Technology: New Generations (ITNG 2010), P176, DOI 10.1109/ITNG.2010.117
[4]   Phishing environments, techniques, and countermeasures: A survey [J].
Aleroud, Ahmed ;
Zhou, Lina .
COMPUTERS & SECURITY, 2017, 68 :160-196
[5]  
Ali W., 2018, IJCSNS, V19, P15
[6]   Hybrid intelligent phishing website prediction using deep neural networks with genetic algorithm-based feature selection and weighting [J].
Ali, Waleed ;
Ahmed, Adel A. .
IET INFORMATION SECURITY, 2019, 13 (06) :659-669
[7]  
Ali W, 2017, INT J ADV COMPUT SC, V8, P72
[8]  
[Anonymous], 2017, Computer Science
[9]  
APWG, Phishing Activity Trends Report
[10]  
Aydin M, 2020, 2020 10TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), P774, DOI [10.1109/CCWC47524.2020.9031125, 10.1109/ccwc47524.2020.9031125]