Detection of malicious URLs using Temporal Convolutional Network and Multi-Head Self-Attention mechanism

被引：0

作者：

Nguyet Quang Do ^{[1
]}

Selamat, Ali ^{[1
,2
,3
,4
]}

Krejcar, Ondrej ^{[1
,4
,5
]}

Fujita, Hamido ^{[1
,6
,7
]}

机构：

[1] Univ Teknol Malaysia, Malaysia Japan Int Inst Technol MJIIT, Kuala Lumpur 50088, Wilayah Perseku, Malaysia

[2] Univ Teknol Malaysia, Fac Comp, Johor Baharu 81310, Johor, Malaysia

[3] Univ Teknol Malaysia, Media & Games Ctr Excellence MagicX, Johor Baharu 81310, Johor, Malaysia

[4] Univ Hradec Kralove, Fac Informat & Management, Ctr Basic & Appl Res, Rokitanskeho 62, Hradec Kralove 50003, Czech Republic

[5] Skoda Auto Univ, Karmeli 1457, Mlada Boleslav 29301, Czech Republic

[6] Univ Hradec Kralove, Fac Sci, Rokitanskeho 62, Hradec Kralove 50003, Czech Republic

[7] Princess Nourah Bint Abdulrahman Univ, Coll Sci, Riyadh, Saudi Arabia

来源：

APPLIED SOFT COMPUTING | 2025年 / 169卷

关键词：

Phishing detection; Malicious URL; Natural language processing; Deep learning; Temporal convolutional network; Attention mechanism;

D O I：

10.1016/j.asoc.2024.112540

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Natural Language Processing (NLP) and Deep Learning (DL) have achieved remarkable results in various fields and have also been proven to be effective in detecting phishing webpages. Inspired by the great success of NLP and DL models in phishing detection-related tasks, we examined the application of these techniques in classifying malicious and benign URLs (Uniform Resource Locator). We found that the existing NLPbased solutions mainly used Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) for phishing URL detection. However, CNN performs poorly when handling non-spatial data, while RNN cannot capture the long-distance dependency and has higher computational complexity. To overcome these issues, this paper proposes a phishing detection model based on Temporal Convolutional Network (TCN) to address the limitations of the conventional CNN and/or RNN algorithms. The proposed model used character-level and word-level embedding methods to obtain the feature representations of the input URLs. Then, TCN was employed for feature extraction and further enhanced with Multi-Head Self-Attention (MHSA) mechanism to classify legitimate and phishing websites. We conducted several experiments to validate the performance of the proposed model and measured various evaluation metrics. The obtained results showed that our solution performed better than other baseline models in classifying malicious URLs, achieving an accuracy of 98.78%. This implied the proposed approach provided an effective and efficient solution for detecting phishing URLs.

引用

页数：17

共 50 条

[1] A malicious network traffic detection model based on bidirectional temporal convolutional network with multi-head self-attention mechanism
Cai, Saihua
Xu, Han
Liu, Mingjie
Chen, Zhilin
Zhang, Guofeng
COMPUTERS & SECURITY, 2024, 136
[2] GCN-MHSA: A novel malicious traffic detection method based on graph convolutional neural network and multi-head self-attention mechanism
Chen, Jinfu
Xie, Haodi
Cai, Saihua
Song, Luo
Geng, Bo
Guo, Wuhao
COMPUTERS & SECURITY, 2024, 147
[3] Epilepsy detection based on multi-head self-attention mechanism
Ru, Yandong
An, Gaoyang
Wei, Zheng
Chen, Hongming
PLOS ONE, 2024, 19 (06):
[4] Multi-head enhanced self-attention network for novelty detection
Zhang, Yingying
Gong, Yuxin
Zhu, Haogang
Bai, Xiao
Tang, Wenzhong
PATTERN RECOGNITION, 2020, 107
[5] Remaining Useful Life Prediction of Bearings Based on Multi-head Self-attention Mechanism, Multi-scale Temporal Convolutional Network and Convolutional Neural Network
Wei, Hao
Gu, Yu
Zhang, Qinghua
2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 3027 - 3032
[6] Intelligent Micro-Kick Detection Using a Multi-Head Self-Attention Network
Zhang, Dezhi
Sun, Weifeng
Dai, Yongshou
Wang, Dongyue
Guo, Yanliang
Gong, Chentao
PROCESSES, 2025, 13 (02)
[7] An Anomaly Detection Approach Based on Bidirectional Temporal Convolutional Network and Multi-Head Attention Mechanism
Wang, Rui
Li, Jiayao
INFORMATION TECHNOLOGY AND CONTROL, 2024, 53 (01): : 37 - 52
[8] CPMA: Spatio-Temporal Network Prediction Model Based on Convolutional Parallel Multi-head Self-attention
Liu, Tiantian
You, Xin
Ma, Ming
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT II, ICIC 2024, 2024, 14876 : 113 - 124
[9] Text summarization based on multi-head self-attention mechanism and pointer network
Qiu, Dong
Yang, Bing
COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (01) : 555 - 567
[10] Text summarization based on multi-head self-attention mechanism and pointer network
Dong Qiu
Bing Yang
Complex & Intelligent Systems, 2022, 8 : 555 - 567

← 1 2 3 4 5 →