A hybrid DNN–LSTM model for detecting phishing URLs

被引:0
作者
Alper Ozcan
Cagatay Catal
Emrah Donmez
Behcet Senturk
机构
[1] Nisantasi University,Department of Computer Engineering
[2] Akdeniz University,Department of Computer Engineering
[3] Qatar University,Department of Computer Science and Engineering
[4] Turgut Ozal University,Department of Management Information Systems
[5] Ankara University,Department of Computer Science and Engineering
来源
Neural Computing and Applications | 2023年 / 35卷
关键词
Phishing; Machine learning; Deep learning; Phishing detection;
D O I
暂无
中图分类号
学科分类号
摘要
Phishing is an attack targeting to imitate the official websites of corporations such as banks, e-commerce, financial institutions, and governmental institutions. Phishing websites aim to access and retrieve users’ important information such as personal identification, social security number, password, e-mail, credit card, and other account information. Several anti-phishing techniques have been developed to cope with the increasing number of phishing attacks so far. Machine learning and particularly, deep learning algorithms are nowadays the most crucial techniques used to detect and prevent phishing attacks because of their strong learning abilities on massive datasets and their state-of-the-art results in many classification problems. Previously, two types of feature extraction techniques [i.e., character embedding-based and manual natural language processing (NLP) feature extraction] were used in isolation. However, researchers did not consolidate these features and therefore, the performance was not remarkable. Unlike previous works, our study presented an approach that utilizes both feature extraction techniques. We discussed how to combine these feature extraction techniques to fully utilize from the available data. This paper proposes hybrid deep learning models based on long short-term memory and deep neural network algorithms for detecting phishing uniform resource locator and evaluates the performance of the models on phishing datasets. The proposed hybrid deep learning models utilize both character embedding and NLP features, thereby simultaneously exploiting deep connections between characters and revealing NLP-based high-level connections. Experimental results showed that the proposed models achieve superior performance than the other phishing detection models in terms of accuracy metric.
引用
收藏
页码:4957 / 4973
页数:16
相关论文
共 180 条
[1]  
Akinyelu AA(2019)Machine learning and nature inspired based phishing detection: a literature survey Int J Artif Intell Tools 28 1930002-196
[2]  
Aleroud Ahmed(2017)Phishing environments, techniques, and countermeasures: a survey Comput Secur 68 160-185
[3]  
Zhou Lina(1992)An introduction to kernel and nearestneighbor nonparametric regression Am Stat 46 175-458
[4]  
Altman NS(2018)Computational intelligence approach for modeling hydrogen production: a review Eng Appl Comput Fluid Mech 12 438-32
[5]  
Ardabili SF(2020)Deep learning-based appearance features extraction for automated carp species identification Aquacult Eng 89 102053-20
[6]  
Najafi B(2001)Random forests Mach Learn 45 5-1092
[7]  
Shamshirband S(2018)A survey of phishing attacks: their types, vectors and technical approaches Expert Syst Appl 106 1-30
[8]  
Bidgoli BM(2020)Heuristic-based strategy for phishing prediction: a survey of url-based approach Comput Secur 88 101613-446
[9]  
Deo RC(2020)A survey of deep learning and its applications: a new paradigm to machine learning Arch Comput Methods Eng 27 1071-2819
[10]  
Chau KW(2020)Detecting internet of things attacks using distributed deep learning J Netw Comput Appl 163 102662-2159