A New Data-Balancing Approach Based on Generative Adversarial Network for Network Intrusion Detection System

被引:9
作者
Jamoos, Mohammad [1 ,2 ]
Mora, Antonio M. [1 ]
AlKhanafseh, Mohammad [3 ]
Surakhi, Ola [4 ]
机构
[1] Univ Granada, Dept Signal Theory Telemat & Commun, Granada 18012, Spain
[2] Al Quds Univ, Sch Sci & Technol, Dept Comp Sci, POB 51000, Jerusalem 51000, Palestine
[3] Birzeit Univ, Dept Comp Sci, Birzeit POB 14, Birzeit, Palestine
[4] Amer Univ Madaba, Dept Comp Sci, Madaba 11821, Jordan
关键词
Generative Adversarial Network; Intrusion Detection System; imbalanced dataset; machine learning; unsupervised learning;
D O I
10.3390/electronics12132851
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An intrusion detection system (IDS) plays a critical role in maintaining network security by continuously monitoring network traffic and host systems to detect any potential security breaches or suspicious activities. With the recent surge in cyberattacks, there is a growing need for automated and intelligent IDSs. Many of these systems are designed to learn the normal patterns of network traffic, enabling them to identify any deviations from the norm, which can be indicative of anomalous or malicious behavior. Machine learning methods have proven to be effective in detecting malicious payloads in network traffic. However, the increasing volume of data generated by IDSs poses significant security risks and emphasizes the need for stronger network security measures. The performance of traditional machine learning methods heavily relies on the dataset and its balanced distribution. Unfortunately, many IDS datasets suffer from imbalanced class distributions, which hampers the effectiveness of machine learning techniques and leads to missed detection and false alarms in conventional IDSs. To address this challenge, this paper proposes a novel model-based generative adversarial network (GAN) called TDCGAN, which aims to improve the detection rate of the minority class in imbalanced datasets while maintaining efficiency. The TDCGAN model comprises a generator and three discriminators, with an election layer incorporated at the end of the architecture. This allows for the selection of the optimal outcome from the discriminators' outputs. The UGR'16 dataset is employed for evaluation and benchmarking purposes. Various machine learning algorithms are used for comparison to demonstrate the efficacy of the proposed TDCGAN model. Experimental results reveal that TDCGAN offers an effective solution for addressing imbalanced intrusion detection and outperforms other traditionally used oversampling techniques. By leveraging the power of GANs and incorporating an election layer, TDCGAN demonstrates superior performance in detecting security threats in imbalanced IDS datasets.
引用
收藏
页数:15
相关论文
共 30 条
[1]  
Alkhanafseh Mohammad Y., 2022, 2022 International Conference on Emerging Trends in Computing and Engineering Applications (ETCEA), P1, DOI 10.1109/ETCEA57049.2022.10009698
[2]  
Alsameraee AAA, 2021, Samarra Journal of Pure and Applied Science, V2, P132, DOI [10.54153/sjpas.2020.v2i3.86, 10.54153/sjpas.2020.v2i3.86, DOI 10.54153/SJPAS.2020.V2I3.86]
[3]   Network Intrusion Detection Model Based on CNN and GRU [J].
Cao, Bo ;
Li, Chenghai ;
Song, Yafei ;
Qin, Yueyi ;
Chen, Chen .
APPLIED SCIENCES-BASEL, 2022, 12 (09)
[4]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[5]   Prefix-preserving IP address anonymization: measurement-based security evaluation and a new cryptography-based scheme [J].
Fan, JL ;
Xu, J ;
Ammar, MH ;
Moon, SB .
COMPUTER NETWORKS, 2004, 46 (02) :253-272
[6]  
Goodfellow I.J., 2014, Generative adversarial networks, DOI [DOI 10.48550/ARXIV.1406.2661, 10.48550/arXiv.1406.2661, 10.48550/ARXIV.1406.2661]
[7]  
Haag P., 2011, NFDUMP NETFLOW PROCE
[8]   A hybrid intrusion detection system based on ABC-AFS algorithm for misuse and anomaly detection [J].
Hajisalem, Vajiheh ;
Babaie, Shahram .
COMPUTER NETWORKS, 2018, 136 :37-50
[9]   ADASYN: Adaptive Synthetic Sampling Approach for Imbalanced Learning [J].
He, Haibo ;
Bai, Yang ;
Garcia, Edwardo A. ;
Li, Shutao .
2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, :1322-1328
[10]   A GAN-Based Anomaly Detection Approach for Imbalanced Industrial Time Series [J].
Jiang, Wenqian ;
Hong, Yang ;
Zhou, Beitong ;
He, Xin ;
Cheng, Cheng .
IEEE ACCESS, 2019, 7 :143608-143619