Detection of malicious URLs using Temporal Convolutional Network and Multi-Head Self-Attention mechanism

被引:0
|
作者
Nguyet Quang Do [1 ]
Selamat, Ali [1 ,2 ,3 ,4 ]
Krejcar, Ondrej [1 ,4 ,5 ]
Fujita, Hamido [1 ,6 ,7 ]
机构
[1] Univ Teknol Malaysia, Malaysia Japan Int Inst Technol MJIIT, Kuala Lumpur 50088, Wilayah Perseku, Malaysia
[2] Univ Teknol Malaysia, Fac Comp, Johor Baharu 81310, Johor, Malaysia
[3] Univ Teknol Malaysia, Media & Games Ctr Excellence MagicX, Johor Baharu 81310, Johor, Malaysia
[4] Univ Hradec Kralove, Fac Informat & Management, Ctr Basic & Appl Res, Rokitanskeho 62, Hradec Kralove 50003, Czech Republic
[5] Skoda Auto Univ, Karmeli 1457, Mlada Boleslav 29301, Czech Republic
[6] Univ Hradec Kralove, Fac Sci, Rokitanskeho 62, Hradec Kralove 50003, Czech Republic
[7] Princess Nourah Bint Abdulrahman Univ, Coll Sci, Riyadh, Saudi Arabia
关键词
Phishing detection; Malicious URL; Natural language processing; Deep learning; Temporal convolutional network; Attention mechanism;
D O I
10.1016/j.asoc.2024.112540
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Natural Language Processing (NLP) and Deep Learning (DL) have achieved remarkable results in various fields and have also been proven to be effective in detecting phishing webpages. Inspired by the great success of NLP and DL models in phishing detection-related tasks, we examined the application of these techniques in classifying malicious and benign URLs (Uniform Resource Locator). We found that the existing NLPbased solutions mainly used Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) for phishing URL detection. However, CNN performs poorly when handling non-spatial data, while RNN cannot capture the long-distance dependency and has higher computational complexity. To overcome these issues, this paper proposes a phishing detection model based on Temporal Convolutional Network (TCN) to address the limitations of the conventional CNN and/or RNN algorithms. The proposed model used character-level and word-level embedding methods to obtain the feature representations of the input URLs. Then, TCN was employed for feature extraction and further enhanced with Multi-Head Self-Attention (MHSA) mechanism to classify legitimate and phishing websites. We conducted several experiments to validate the performance of the proposed model and measured various evaluation metrics. The obtained results showed that our solution performed better than other baseline models in classifying malicious URLs, achieving an accuracy of 98.78%. This implied the proposed approach provided an effective and efficient solution for detecting phishing URLs.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] A malicious network traffic detection model based on bidirectional temporal convolutional network with multi-head self-attention mechanism
    Cai, Saihua
    Xu, Han
    Liu, Mingjie
    Chen, Zhilin
    Zhang, Guofeng
    COMPUTERS & SECURITY, 2024, 136
  • [2] GCN-MHSA: A novel malicious traffic detection method based on graph convolutional neural network and multi-head self-attention mechanism
    Chen, Jinfu
    Xie, Haodi
    Cai, Saihua
    Song, Luo
    Geng, Bo
    Guo, Wuhao
    COMPUTERS & SECURITY, 2024, 147
  • [3] Epilepsy detection based on multi-head self-attention mechanism
    Ru, Yandong
    An, Gaoyang
    Wei, Zheng
    Chen, Hongming
    PLOS ONE, 2024, 19 (06):
  • [4] Multi-head enhanced self-attention network for novelty detection
    Zhang, Yingying
    Gong, Yuxin
    Zhu, Haogang
    Bai, Xiao
    Tang, Wenzhong
    PATTERN RECOGNITION, 2020, 107
  • [5] Remaining Useful Life Prediction of Bearings Based on Multi-head Self-attention Mechanism, Multi-scale Temporal Convolutional Network and Convolutional Neural Network
    Wei, Hao
    Gu, Yu
    Zhang, Qinghua
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 3027 - 3032
  • [6] Intelligent Micro-Kick Detection Using a Multi-Head Self-Attention Network
    Zhang, Dezhi
    Sun, Weifeng
    Dai, Yongshou
    Wang, Dongyue
    Guo, Yanliang
    Gong, Chentao
    PROCESSES, 2025, 13 (02)
  • [7] An Anomaly Detection Approach Based on Bidirectional Temporal Convolutional Network and Multi-Head Attention Mechanism
    Wang, Rui
    Li, Jiayao
    INFORMATION TECHNOLOGY AND CONTROL, 2024, 53 (01): : 37 - 52
  • [8] CPMA: Spatio-Temporal Network Prediction Model Based on Convolutional Parallel Multi-head Self-attention
    Liu, Tiantian
    You, Xin
    Ma, Ming
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT II, ICIC 2024, 2024, 14876 : 113 - 124
  • [9] Text summarization based on multi-head self-attention mechanism and pointer network
    Qiu, Dong
    Yang, Bing
    COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (01) : 555 - 567
  • [10] Text summarization based on multi-head self-attention mechanism and pointer network
    Dong Qiu
    Bing Yang
    Complex & Intelligent Systems, 2022, 8 : 555 - 567