Applications of deep learning for phishing detection: a systematic literature review

被引:44
作者
Catal, Cagatay [1 ]
Giray, Gorkem
Tekinerdogan, Bedir [2 ]
Kumar, Sandeep [3 ]
Shukla, Suyash [3 ]
机构
[1] Qatar Univ, Dept Comp Sci & Engn, Doha, Qatar
[2] Wageningen Univ & Res, Informat Technol Grp, Wageningen, Netherlands
[3] Indian Inst Technol Roorkee, Dept Comp Sci & Engn, Roorkee, Uttar Pradesh, India
关键词
Phishing detection; Malicious URL prediction; Deep learning; Machine learning; Systematic literature review (SLR); Cybersecurity; CONVOLUTIONAL NEURAL-NETWORK; FEATURE-SELECTION; CLASSIFICATION; MODEL; ATTACKS;
D O I
10.1007/s10115-022-01672-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Phishing attacks aim to steal confidential information using sophisticated methods, techniques, and tools such as phishing through content injection, social engineering, online social networks, and mobile applications. To avoid and mitigate the risks of these attacks, several phishing detection approaches were developed, among which deep learning algorithms provided promising results. However, the results and the corresponding lessons learned are fragmented over many different studies and there is a lack of a systematic overview of the use of deep learning algorithms in phishing detection. Hence, we performed a systematic literature review (SLR) to identify, assess, and synthesize the results on deep learning approaches for phishing detection as reported by the selected scientific publications. We address nine research questions and provide an overview of how deep learning algorithms have been used for phishing detection from several aspects. In total, 43 journal articles were selected from electronic databases to derive the answers for the defined research questions. Our SLR study shows that except for one study, all the provided models applied supervised deep learning algorithms. The widely used data sources were URL-related data, third party information on the website, website content-related data, and email. The most used deep learning algorithms were deep neural networks (DNN), convolutional neural networks, and recurrent neural networks/long short-term memory networks. DNN and hybrid deep learning algorithms provided the best performance among other deep learning-based algorithms. 72% of the studies did not apply any feature selection algorithm to build the prediction model. PhishTank was the most used dataset among other datasets. While Keras and Tensorflow were the most preferred deep learning frameworks, 46% of the articles did not mention any framework. This study also highlights several challenges for phishing detection to pave the way for further research.
引用
收藏
页码:1457 / 1500
页数:44
相关论文
共 50 条
[21]   Deep Learning-Based Network Intrusion Detection Systems: A Systematic Literature Review [J].
Mutembei, Leonard L. ;
Senekane, Makhamisa C. ;
van Zyl, Terence .
ARTIFICIAL INTELLIGENCE RESEARCH, SACAIR 2024, 2025, 2326 :207-234
[22]   Machine and Deep Learning-based XSS Detection Approaches: A Systematic Literature Review [J].
Thajeel, Isam Kareem ;
Samsudin, Khairulmizam ;
Hashim, Shaiful Jahari ;
Hashim, Fazirulhisyam .
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (07)
[23]   Crop mapping using supervised machine learning and deep learning: a systematic literature review [J].
Alami Machichi, Mouad ;
Mansouri, Loubna El ;
Imani, Yasmina ;
Bourja, Omar ;
Lahlou, Ouiam ;
Zennayi, Yahya ;
Bourzeix, Francois ;
Hanade Houmma, Ismaguil ;
Hadria, Rachid .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (08) :2717-2753
[24]   Deep Learning Methods for Malware and Intrusion Detection: A Systematic Literature Review [J].
Ali, Rahman ;
Ali, Asmat ;
Iqbal, Farkhund ;
Hussain, Mohammed ;
Ullah, Farhan .
SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
[25]   Fake News Detection Using Deep Learning: A Systematic Literature Review [J].
Alnabhan, Mohammad Q. ;
Branco, Paula .
IEEE ACCESS, 2024, 12 :114435-114459
[26]   Applications of deep learning in detection of glaucoma: A systematic review [J].
Mirzania, Delaram ;
Thompson, Atalie C. ;
Muir, Kelly W. .
EUROPEAN JOURNAL OF OPHTHALMOLOGY, 2021, 31 (04) :1618-1642
[27]   Business Email Compromise Phishing Detection Based on Machine Learning: A Systematic Literature Review [J].
Atlam, Hany F. ;
Oluwatimilehin, Olayonu .
ELECTRONICS, 2023, 12 (01)
[28]   DEPHIDES: Deep Learning Based Phishing Detection System [J].
Sahingoz, Ozgur Koray ;
Buber, Ebubekir ;
Kugu, Emin .
IEEE ACCESS, 2024, 12 :8052-8070
[29]   A systematic literature review on the significance of deep learning and machine learning in predicting Alzheimer's disease [J].
Kaur, Arshdeep ;
Mittal, Meenakshi ;
Bhatti, Jasvinder Singh ;
Thareja, Suresh ;
Singh, Satwinder .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2024, 154
[30]   A systematic review of literature on credit card cyber fraud detection using machine and deep learning [J].
Btoush E.A.L.M. ;
Zhou X. ;
Gururajan R. ;
Chan K.C. ;
Genrich R. ;
Sankaran P. .
PeerJ Computer Science, 2023, 9