Optimal feature selection for machine learning based intrusion detection system by exploiting attribute dependence

被引:13
作者
Dubey, Ghanshyam Prasad [1 ]
Bhujade, Rakesh Kumar [1 ]
机构
[1] Mandsaur Univ, Dept CSE, Mandsaur, MP, India
关键词
Feature selection; Mutual information; Correlation; Intrusion detection; Machine learning;
D O I
10.1016/j.matpr.2021.04.643
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Feature Engineering plays an important role in the development of a Machine Learning-based Classifier; especially for Intrusion Detection Systems. It helps in reducing the dimensions of the available datasets, training time, and computation costs; yet improves the performance and detection accuracy of the model. Feature Selection is the most common technique used for reducing the dimensionality of the available dataset. The higher the dimensions of the dataset; the more will be the training time required by the Machine Learning model to process (train and test) the dataset. This paper proposes two approaches for constructing an optimal feature subset, termed Dense_FR and Sparse_FR; to reduce the dimensions of the dataset, based on Kendall's Correlation Coefficient and Mutual Information. Mutual Information is an important and common metric used for Feature Selection. It tries to reduce the amount of uncertainty by incorporating additional attributes. Kendall's Correlation Coefficient is a stricter and consistent correlation coefficient when compared to Pearson's Coefficient or Spearman's Coefficient. The names Dense_FR and Sparse_FR justify the number of features generated in the optimal feature subsets; there are fewer features in the optimal subset generated by the Sparse_FR approach when compared to the Dense_FR approach. Results show that the proposed approaches improve the performance of classification. (c) 2021 Elsevier Ltd. All rights reserved. Selection and peer-review under responsibility of the scientific committee of the Technology Innovation in Mechanical Engineering-2021.
引用
收藏
页码:6325 / 6331
页数:7
相关论文
共 22 条
[1]   Network intrusion detection system: A systematic study of machine learning and deep learning approaches [J].
Ahmad, Zeeshan ;
Shahid Khan, Adnan ;
Wai Shiang, Cheah ;
Abdullah, Johari ;
Ahmad, Farhan .
TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2021, 32 (01)
[2]   A feature reduced intrusion detection system using ANN classifier [J].
Akashdeep ;
Manzoor, Ishfaq ;
Kumar, Neeraj .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 88 :249-257
[3]   Deep learning approaches for anomaly-based intrusion detection systems: A survey, taxonomy, and open issues [J].
Aldweesh, Arwa ;
Derhab, Abdelouahid ;
Emam, Ahmed Z. .
KNOWLEDGE-BASED SYSTEMS, 2020, 189
[4]   Intrusion Detection Systems, Issues, Challenges, and Needs [J].
Aljanabi, Mohammad ;
Ismail, Mohd Arfian ;
Ali, Ahmed Hussein .
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2021, 14 (01) :560-571
[5]  
Almseidin M, 2017, I S INTELL SYST INFO, P277, DOI 10.1109/SISY.2017.8080566
[6]  
Anish Halimaa A., 2019, 2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI). Proceedings, P916, DOI 10.1109/ICOEI.2019.8862784
[7]   Feature selection in machine learning: A new perspective [J].
Cai, Jie ;
Luo, Jiawei ;
Wang, Shulin ;
Yang, Sheng .
NEUROCOMPUTING, 2018, 300 :70-79
[8]  
Guha R, 2020, 2020 IEEE CALCUTTA CONFERENCE (CALCON), P54, DOI [10.1109/CALCON49167.2020.9106516, 10.1109/calcon49167.2020.9106516]
[9]   Intrusion Detection System (IDS): Anomaly Detection using Outlier Detection Approach [J].
Jabez, J. ;
Muthukumar, B. .
INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND CONVERGENCE (ICCC 2015), 2015, 48 :338-346
[10]  
Karatas G, 2018, 2018 INTERNATIONAL CONGRESS ON BIG DATA, DEEP LEARNING AND FIGHTING CYBER TERRORISM (IBIGDELFT), P113, DOI 10.1109/IBIGDELFT.2018.8625278