AI Meta-Learners and Extra-Trees Algorithm for the Detection of Phishing Websites

被引:88
作者
Alsariera, Yazan Ahmad [1 ]
Adeyemo, Victor Elijah [2 ]
Balogun, Abdullateef Oluwagbemiga [3 ,4 ]
Alazzawi, Ammar Kareem [3 ]
机构
[1] Northern Border Univ, Fac Sci, Dept Comp Sci, Ar Ar 73222, Saudi Arabia
[2] Leeds Beckett Univ, Sch Built Environm Engn & Comp, Leeds LS6 3QS, W Yorkshire, England
[3] Univ Teknol PETRONAS, Fac Sci & IT, Dept Comp & Informat Sci, Seri Iskandar 32610, Perak, Malaysia
[4] Univ Ilorin, Fac Commun & Informat Sci, Dept Comp Sci, Ilorin 1515, Nigeria
关键词
Artificial intelligence (AI); cyber security; extra trees; phishing; phishing website detection; meta; -; earners; FEATURE-SELECTION; ENSEMBLE;
D O I
10.1109/ACCESS.2020.3013699
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Phishing is a type of social web-engineering attack in cyberspace where criminals steal valuable data or information from insensitive or uninformed users of the internet. Existing countermeasures in the form of anti-phishing software and computational methods for detecting phishing activities have proven to be effective. However, new methods are deployed by hackers to thwart these countermeasures. Due to the evolving nature of phishing attacks, the need for novel and efficient countermeasures becomes crucial as the effect of phishing attacks are often fatal and disastrous. Artificial Intelligence (AI) schemes have been the cornerstone of modern countermeasures used for mitigating phishing attacks. AI-based phishing countermeasures or methods possess their shortcomings particularly the high false alarm rate and the inability to interpret how most phishing methods perform their function. This study proposed four (4) meta-learner models (AdaBoost-Extra Tree (ABET), Bagging - Extra tree (BET), Rotation Forest - Extra Tree (RoFBET) and LogitBoost-Extra Tree (LBET)) developed using the extra-tree base classifier. The proposed AI-based meta-learners were fitted on phishing website datasets (currently with the newest features) and their performances were evaluated. The models achieved a detection accuracy not lower than 97% with a drastically low false-positive rate of not more 0.028. In addition, the proposed models outperform existing ML-based models in phishing attack detection. Hence, we recommend the adoption of meta-learners when building phishing attack detection models.
引用
收藏
页码:142532 / 142542
页数:11
相关论文
共 31 条
[1]   Conditioning factors determination for mapping and prediction of landslide susceptibility using machine learning algorithms [J].
Al-Najjar, Husam A. H. ;
Kalantar, Bahareh ;
Pradhan, Biswjaeet ;
Saeidi, Vahideh .
EARTH RESOURCES AND ENVIRONMENTAL REMOTE SENSING/GIS APPLICATIONS X, 2019, 11156
[2]   Hybrid intelligent phishing website prediction using deep neural networks with genetic algorithm-based feature selection and weighting [J].
Ali, Waleed ;
Ahmed, Adel A. .
IET INFORMATION SECURITY, 2019, 13 (06) :659-669
[3]  
Alqahtani M., 2019, P INT C COMP INF SCI, P1
[4]  
[Anonymous], 2019, RECENT FINDINGS INTE, DOI DOI 10.1016/B978-0-08-102292-4.00015-1
[5]  
[Anonymous], 2017, NEUROCOMPUTING, DOI DOI 10.1016/J.NEUCOM.2016.09.077
[6]  
[Anonymous], 2018, IEEE INT SYMP PARAL, DOI DOI 10.1109/BDCLOUD.2018.00114
[7]  
Balogun AO, 2019, AFRICAN J COMPUT ICT, V12, P30
[8]   A simple plug-in bagging ensemble based on threshold-moving for classifying binary and multiclass imbalanced data [J].
Collell, Guillem ;
Prelec, Drazen ;
Patil, Kaustubh R. .
NEUROCOMPUTING, 2018, 275 :330-340
[9]  
Dedakia M., 2015, J ENG COMPUTERS APPL, V4, P209
[10]  
Elijah AV, 2019, INT J ADV COMPUT SC, V10, P520