A Novel Study: GAN-Based Minority Class Balancing and Machine-Learning-Based Network Intruder Detection Using Chi-Square Feature Selection

被引:7
作者
Alabrah, Amerah [1 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, Dept Informat Syst, Riyadh 11451, Saudi Arabia
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 22期
关键词
chi-square; class imbalance; feature selection; GAN; network intruder detection; machine learning; CLASSIFICATION;
D O I
10.3390/app122211662
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The network security problem becomes a routine problem for networks and cyber security specialists. The increased data on every minute not only creates big data problems, but also it expands the network size on the cloud and other computing technologies. Due to the big size and data, the network becomes more vulnerable to cyber-attacks. However, the detection of cyber-attacks on networks before or on time is a challenging task to solve. Therefore, the network intruder detection system (NIDS) is used to detect it. The network provided data-based NIDS were proposed previously, but still needed improvements. From the network data, it is also essential to find the most contributing features to avoid overfitting and lack of confidence in NIDS. The previously proposed solutions of NIDS mostly ignored the class imbalance problems that were normally found in the training of machine learning (ML) methods used in NIDS. However, few studies have tried to solve class imbalance and feature selection separately by achieving significant results on different datasets. The performance of these NIDS needs improvements in terms of classification and class balancing robust solutions. Therefore, to solve the class imbalance problem of minority classes in public datasets of NIDS and to select the most significant features, the proposed study gives a framework. In this framework, the minority class instances are generated using Generative Adversarial Network (GAN) model hyperparameter optimization and then the chi-square method of feature selection is applied to the fed six ML classifiers. The binary and multi-class classifications are applied on the UNSW-NB15 dataset with three versions of it. The comparative analysis on binary, multi-class classifications showed dominance as compared to previous studies in terms of accuracy (98.14%, 87.44%), precision (98.14%, 87.81%), F1-score (98.14%, 86.79%), Geometric-Mean (0.976, 0.923) and Area Under Cover (0.976, 0.94).
引用
收藏
页数:17
相关论文
共 30 条
  • [1] Aditsania A, 2017, 2017 3RD INTERNATIONAL CONFERENCE ON SCIENCE IN INFORMATION TECHNOLOGY (ICSITECH), P533, DOI 10.1109/ICSITech.2017.8257170
  • [2] A Feature Selection Model for Network Intrusion Detection System Based on PSO, GWO, FFA and GA Algorithms
    Almomani, Omar
    [J]. SYMMETRY-BASEL, 2020, 12 (06): : 1 - 20
  • [3] Comparing Oversampling Techniques to Handle the Class Imbalance Problem: A Customer Churn Prediction Case Study
    Amin, Adnan
    Anwar, Sajid
    Adnan, Awais
    Nawaz, Muhammad
    Howard, Newton
    Qadir, Junaid
    Hawalah, Ahmad
    Hussain, Amir
    [J]. IEEE ACCESS, 2016, 4 : 7940 - 7957
  • [4] Anderson J.P., 1980, Computer Security Threat Monitoring and Surveillance
  • [5] Ashrapov I., 2020, arXiv
  • [6] Ashrapov I., 2020, GANS TABULAR DATA
  • [7] Bayerl P.S., 2017, COMMUNITY POLICING A
  • [8] SMOTE: Synthetic minority over-sampling technique
    Chawla, Nitesh V.
    Bowyer, Kevin W.
    Hall, Lawrence O.
    Kegelmeyer, W. Philip
    [J]. 2002, American Association for Artificial Intelligence (16)
  • [9] Ensemble based collaborative and distributed intrusion detection systems: A survey
    Folino, Gianluigi
    Sabatino, Pietro
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2016, 66 : 1 - 16
  • [10] A Deep Learning Model for Network Intrusion Detection with Imbalanced Data
    Fu, Yanfang
    Du, Yishuai
    Cao, Zijian
    Li, Qiang
    Xiang, Wei
    [J]. ELECTRONICS, 2022, 11 (06)