A malicious network traffic detection model based on bidirectional temporal convolutional network with multi-head self-attention mechanism

被引:8
|
作者
Cai, Saihua [1 ,2 ]
Xu, Han [1 ]
Liu, Mingjie [1 ]
Chen, Zhilin [1 ]
Zhang, Guofeng [3 ]
机构
[1] Jiangsu Univ, Sch Comp Sci & Commun Engn, Zhenjiang 212013, Peoples R China
[2] Jiangsu Univ, Jiangsu Key Lab Secur Technol Ind Cyberspace, Zhenjiang 212013, Peoples R China
[3] Taishan Univ, Sch Informat Sci & Technol, Tai An 271000, Peoples R China
基金
中国国家自然科学基金;
关键词
Malicious network traffic detection; Bidirectional temporal convolutional network; Multi -head self -attention mechanism; Cross -entropy loss function; Deep learning;
D O I
10.1016/j.cose.2023.103580
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The increasingly frequent network intrusions have brought serious impacts to the production and life, thus malicious network traffic detection has received more and more attention in recent years. However, the traditional rule matching-based and machine learning-based malicious network traffic detection methods have the problems of relying on human experience as well as low detection efficiency. The continuous development of deep learning technology provides new ideas to solve malicious network traffic detection, and the deep learning models are also widely used in the field of malicious network traffic detection. Compared with other deep learning models, bidirectional temporal convolutional network (BiTCN) has achieved better detection results due to its ability to obtain bidirectional semantic features of network traffic, but it does not consider the different meanings as well as different importance of different subsequence segments in network traffic sequences; In addition, the loss function used in BiTCN is the negative log likelihood function, which may lead to overfitting problems when facing multi-classification problems and data imbalance problems. To solve these problems, this paper proposes a malicious network traffic detection model based on BiTCN and multi-head self-attention (MHSA) mechanism, namely BiTCN_MHSA, it innovatively uses the MHSA mechanism to assign different weights to different subsequences of network traffic, thus making the model more focused on the characteristics of malicious network traffic as well as improving the efficiency of processing global network traffic; Moreover, it also changes its loss function to a cross-entropy loss function to penalize misclassification more severely, thereby speeding up the convergence. Finally, extensive experiments are conduced to evaluate the efficiency of proposed BiTCN_MHSA model on two public network traffic, the experimental results verify that the proposed BiTCN_MHSA model outperforms six state-of-the-arts in precision, recall, F1-measure and accuracy.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Detection of malicious URLs using Temporal Convolutional Network and Multi-Head Self-Attention mechanism
    Nguyet Quang Do
    Selamat, Ali
    Krejcar, Ondrej
    Fujita, Hamido
    APPLIED SOFT COMPUTING, 2025, 169
  • [2] An Anomaly Detection Approach Based on Bidirectional Temporal Convolutional Network and Multi-Head Attention Mechanism
    Wang, Rui
    Li, Jiayao
    INFORMATION TECHNOLOGY AND CONTROL, 2024, 53 (01): : 37 - 52
  • [3] GCN-MHSA: A novel malicious traffic detection method based on graph convolutional neural network and multi-head self-attention mechanism
    Chen, Jinfu
    Xie, Haodi
    Cai, Saihua
    Song, Luo
    Geng, Bo
    Guo, Wuhao
    COMPUTERS & SECURITY, 2024, 147
  • [4] TLS-MHSA: An Efficient Detection Model for Encrypted Malicious Traffic based on Multi-Head Self-Attention Mechanism
    Chen, Jinfu
    Song, Luo
    Cai, Saihua
    Xie, Haodi
    Yin, Shang
    Ahmad, Bilal
    ACM TRANSACTIONS ON PRIVACY AND SECURITY, 2023, 26 (04)
  • [5] CPMA: Spatio-Temporal Network Prediction Model Based on Convolutional Parallel Multi-head Self-attention
    Liu, Tiantian
    You, Xin
    Ma, Ming
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT II, ICIC 2024, 2024, 14876 : 113 - 124
  • [6] Remaining Useful Life Prediction of Bearings Based on Multi-head Self-attention Mechanism, Multi-scale Temporal Convolutional Network and Convolutional Neural Network
    Wei, Hao
    Gu, Yu
    Zhang, Qinghua
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 3027 - 3032
  • [7] AttenEpilepsy: A 2D convolutional network model based on multi-head self-attention
    Ma, Shuang
    Wang, Haifeng
    Yu, Zhihao
    Du, Luyao
    Zhang, Ming
    Fu, Qingxi
    ENGINEERING ANALYSIS WITH BOUNDARY ELEMENTS, 2024, 169
  • [8] MSASGCN : Multi-Head Self-Attention Spatiotemporal Graph Convolutional Network for Traffic Flow Forecasting
    Cao, Yang
    Liu, Detian
    Yin, Qizheng
    Xue, Fei
    Tang, Hengliang
    JOURNAL OF ADVANCED TRANSPORTATION, 2022, 2022
  • [9] Multi-head enhanced self-attention network for novelty detection
    Zhang, Yingying
    Gong, Yuxin
    Zhu, Haogang
    Bai, Xiao
    Tang, Wenzhong
    PATTERN RECOGNITION, 2020, 107
  • [10] Text summarization based on multi-head self-attention mechanism and pointer network
    Qiu, Dong
    Yang, Bing
    COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (01) : 555 - 567