Optimal feature selection for machine learning based intrusion detection system by exploiting attribute dependence

被引:13
|
作者
Dubey, Ghanshyam Prasad [1 ]
Bhujade, Rakesh Kumar [1 ]
机构
[1] Mandsaur Univ, Dept CSE, Mandsaur, MP, India
关键词
Feature selection; Mutual information; Correlation; Intrusion detection; Machine learning;
D O I
10.1016/j.matpr.2021.04.643
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Feature Engineering plays an important role in the development of a Machine Learning-based Classifier; especially for Intrusion Detection Systems. It helps in reducing the dimensions of the available datasets, training time, and computation costs; yet improves the performance and detection accuracy of the model. Feature Selection is the most common technique used for reducing the dimensionality of the available dataset. The higher the dimensions of the dataset; the more will be the training time required by the Machine Learning model to process (train and test) the dataset. This paper proposes two approaches for constructing an optimal feature subset, termed Dense_FR and Sparse_FR; to reduce the dimensions of the dataset, based on Kendall's Correlation Coefficient and Mutual Information. Mutual Information is an important and common metric used for Feature Selection. It tries to reduce the amount of uncertainty by incorporating additional attributes. Kendall's Correlation Coefficient is a stricter and consistent correlation coefficient when compared to Pearson's Coefficient or Spearman's Coefficient. The names Dense_FR and Sparse_FR justify the number of features generated in the optimal feature subsets; there are fewer features in the optimal subset generated by the Sparse_FR approach when compared to the Dense_FR approach. Results show that the proposed approaches improve the performance of classification. (c) 2021 Elsevier Ltd. All rights reserved. Selection and peer-review under responsibility of the scientific committee of the Technology Innovation in Mechanical Engineering-2021.
引用
收藏
页码:6325 / 6331
页数:7
相关论文
共 50 条
  • [1] INTRUSION DETECTION BASED ON MACHINE LEARNING AND FEATURE SELECTION
    Alaoui, Souad
    El Gonnouni, Amina
    Lyhyaoui, Abdelouahid
    MENDEL 2011 - 17TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING, 2011, : 199 - 206
  • [2] Lightweight Intrusion Detection Based on Hybrid Feature Selection Machine Learning
    Xia, Guoxin
    Zhao, Yanqiao
    Han, Chaohui
    Zhao, Xiaosong
    Zhang, Lei
    39TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION, YAC 2024, 2024, : 1392 - 1395
  • [3] Automatic Feature Extraction and Selection For Machine Learning Based Intrusion Detection
    Liu, Jinjie
    Chung, Sun Sunnie
    2019 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI 2019), 2019, : 1400 - 1405
  • [4] Machine learning-based intrusion detection: feature selection versus feature extraction
    Ngo, Vu-Duc
    Vuong, Tuan-Cuong
    Van Luong, Thien
    Tran, Hung
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (03): : 2365 - 2379
  • [5] INTRUSION DETECTION SYSTEM BASED ON FEATURE SELECTION AND SUPPORT VECTOR MACHINE
    Zhang Xue-qin
    Gu Chun-hua
    Lin Jia-jun
    2006 FIRST INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND NETWORKING IN CHINA, 2006,
  • [6] Optimizing IoT intrusion detection system: feature selection versus feature extraction in machine learning
    Li, Jing
    Othman, Mohd Shahizan
    Chen, Hewan
    Yusuf, Lizawati Mi
    JOURNAL OF BIG DATA, 2024, 11 (01)
  • [7] Optimizing IoT intrusion detection system: feature selection versus feature extraction in machine learning
    Jing Li
    Mohd Shahizan Othman
    Hewan Chen
    Lizawati Mi Yusuf
    Journal of Big Data, 11
  • [8] Feature Selection and Intrusion Detection in Cloud Environment based on Machine Learning Algorithms
    Javadpour, Amir
    Abharian, Sanaz Kazemi
    Wang, Guojun
    2017 15TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS AND 2017 16TH IEEE INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING AND COMMUNICATIONS (ISPA/IUCC 2017), 2017, : 1417 - 1421
  • [9] Network Intrusion Detection Leveraging Machine Learning and Feature Selection
    Ali, Arshid
    Shaukat, Shahtaj
    Tayyab, Muhammad
    Khan, Muazzam A.
    Khan, Jan Sher
    Arshad
    Ahmad, Jawad
    2020 IEEE 17TH INTERNATIONAL CONFERENCE ON SMART COMMUNITIES: IMPROVING QUALITY OF LIFE USING ICT, IOT AND AI (IEEEHONET 2020), 2020, : 49 - 53
  • [10] Robust machine learning based Intrusion detection system using simple statistical techniques in feature selection
    Kaushik, Sunil
    Bhardwaj, Akashdeep
    Almogren, Ahmad
    Bharany, Salil
    Altameem, Ayman
    Rehman, Ateeq Ur
    Hussen, Seada
    Hamam, Habib
    SCIENTIFIC REPORTS, 2025, 15 (01):