Light gradient boosting machine-based phishing webpage detection model using phisher website features of mimic URLs

被引:13
作者
Oram, Etuari [1 ]
Dash, Pandit Byomakesha [2 ]
Naik, Bighnaraj [1 ]
Nayak, Janmenjoy [3 ]
Vimal, S. [4 ]
Nataraj, Sathees Kumar [5 ]
机构
[1] Veer Surendra Sai Univ Technol, Dept Comp Applicat, Burla 768018, Odisha, India
[2] Aditya Inst Technol & Management AITAM, Dept Informat Technol, Tekkali 532201, India
[3] Aditya Inst Technol & Management AITAM, Dept Comp Sci & Engn, Tekkali 532201, India
[4] Ramco Inst Technol, Dept Comp Sci & Engn, Rajapalayam 626117, Tamil Nadu, India
[5] AMA Int Univ Bahrain, Coll Engn, Dept Mechatron, Salmabad, Bahrain
关键词
Security; Email phishing; Ensemble learning; Light gradient boosting machine; Exclusive feature bundling; Gradient-based one-side sampling; FEATURE-SELECTION;
D O I
10.1016/j.patrec.2021.09.018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the advent of the 20th century, the popularity of digital service usages is increasing every day. The internet has always been a popular communication method, and phishing webpages have been a challenging issue for more than two decades. Especially, E-commerce and other global companies face enormous challenges due to phishing of websites. Many developed countries have reported substantial economic loss due to unwanted phishing activities. With the exponential increase of digital communica-tions, these phishing activities are going to be increased. There is a need for an effective intrinsic phish-ing detection technique. Phishing websites have some unique features by which they can be identified. In this research, a Light gradient boosting machine-based phishing email detection model using phisher websites' features of mimic URLs has been proposed. The primary objective is to develop a highly secured and accurate model for successful identification of security breach through websites phishing. With the performance comparison of other ensemble as well as state-of-the-art machine learning models, the pro-posed model resulted high performance accuracy and proved to a robust approach for phishing activity. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:100 / 106
页数:7
相关论文
共 22 条
[1]  
Abdelhamid N, 2017, 2017 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS (ISI), P72, DOI 10.1109/ISI.2017.8004877
[2]   Classification of Phishing Email Using Random Forest Machine Learning Technique [J].
Akinyelu, Andronicus A. ;
Adewumi, Aderemi O. .
JOURNAL OF APPLIED MATHEMATICS, 2014,
[3]  
Ali W, 2017, INT J ADV COMPUT SC, V8, P72
[4]  
[Anonymous], 2015, PHISHING DATASET MAC
[5]   Data Analytics: Intelligent Anti-Phishing Techniques Based on Machine Learning [J].
Baadel, Said ;
Lu, Joan .
JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2019, 18 (01)
[6]  
BUBER E, 2017, 2017 INT ART INT DAT, P1, DOI DOI 10.1109/IDAP.2017.8090317
[7]   A new hybrid ensemble feature selection framework for machine learning-based phishing detection system [J].
Chiew, Kang Leng ;
Tan, Choon Lin ;
Wong, KokSheik ;
Yong, Kelvin S. C. ;
Tiong, Wei King .
INFORMATION SCIENCES, 2019, 484 :153-166
[8]   Phishing Email Feature Selection Approach [J].
Hamid, Isredza Rahmi A. ;
Abawajy, Jemal .
TRUSTCOM 2011: 2011 INTERNATIONAL JOINT CONFERENCE OF IEEE TRUSTCOM-11/IEEE ICESS-11/FCST-11, 2011, :916-921
[9]   Phishing Websites Prediction Using Classification Techniques [J].
Ibrahim, Dyana Rashid ;
Hadi, Ali Hussein .
2017 INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2017, :133-137
[10]  
Jain Ankit Kumar, 2018, Cyber Security. Proceedings of CSI 2015. Advances in Intelligent Systems and Computing (AISC 729), P467, DOI 10.1007/978-981-10-8536-9_44