An Agile Approach to Identify Single and Hybrid Normalization for Enhancing Machine Learning-Based Network Intrusion Detection

被引:25
作者
Siddiqi, Murtaza Ahmed [1 ]
Pak, Wooguil [1 ]
机构
[1] Yeungnam Univ, Dept Informat & Commun, Gyongsan 38541, South Korea
基金
新加坡国家研究基金会;
关键词
Intrusion detection; Mathematical models; Feature extraction; Training; Standards; Statistical analysis; Numerical models; Anomaly detection; Bot-IoT; CIC-IDS; 2017; intrusion detection; IoT; ISCX-IDS; 2012; normalization; NSL KDD; skewness; scaling; transformation; UNSW-NB15;
D O I
10.1109/ACCESS.2021.3118361
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Detecting intrusion in network traffic has remained a problematic task for years. Progress in the field of machine learning is paving the way for enhancing intrusion detection systems. Due to this progress intrusion detection has become an integral part of network security. Intrusion detection has achieved high detection accuracy with the help of supervised machine learning methods. A key factor in enhancing the performance of supervised classifiers is how data is augmented for training the classification model. Data in real-world networks or publicly available datasets are not always normally (Gaussian) distributed. Instead, the distributions of variables are more likely to be skewed. To achieve a high detection rate, data normalization or transformation plays an important role for machine learning-based intrusion detection systems. Several methods are available to normalize the attributes of the data before training a classification model. However, opting for the most suitable normalization technique is still a questionable task. In this paper, a statistical method is proposed that can identify the most suitable normalization method for the dataset. The normalization method identified by the proposed approach gives the highest accuracy for an intrusion detection system. To highlight the efficiency of the proposed method, five different datasets were used with two different feature selection methods. The datasets belong to both Internet of things and traditional network environments. The proposed method is also able to identify hybrid normalizations to achieve even improved intrusion detection results.
引用
收藏
页码:137494 / 137513
页数:20
相关论文
共 59 条
  • [1] Abdulrazaq, 2019, P INT C ADV SCI ENG, P94
  • [2] Mobile Encrypted Traffic Classification Using Deep Learning: Experimental Evaluation, Lessons Learned, and Challenges
    Aceto, Giuseppe
    Ciuonzo, Domenico
    Montieri, Antonio
    Pescape, Antonio
    [J]. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2019, 16 (02): : 445 - 458
  • [3] Ahemad, 2019, DATA SCI
  • [4] Albulayhi K, 2021, 2021 IEEE WORLD AI IOT CONGRESS (AIIOT), P187, DOI [10.1109/AIIoT52608.2021.9454168, 10.1109/AIIOT52608.2021.9454168]
  • [5] [Anonymous], Minitab Statistical Software
  • [6] Antoch J, 2004, P COMP STAT, P753
  • [7] Bevans R., 2021, P VALUE EXPLAINED
  • [8] A Hierarchical Hybrid Intrusion Detection Approach in IoT Scenarios
    Bovenzi, Giampaolo
    Aceto, Giuseppe
    Ciuonzo, Domenico
    Persico, Valerio
    Pescape, Antonio
    [J]. 2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [9] Brownlee J., 2020, MACHINE LEARNING MAS
  • [10] Brownlee J., 2020, How to Scale Data With Outliers For Machine Learning