Feature selection for intrusion detection using new multi-objective estimation of distribution algorithms

被引:35
作者
Maza, Sofiane [1 ]
Touahria, Mohamed [1 ]
机构
[1] Univ Ferhat Abbas Setif 1, Dept Comp Sci, Setif 19000, Algeria
关键词
Intrusion detection system; Feature selection; Multi-Objective optimization; Estimation of distribution algorithms; Mutual information; PARTICLE SWARM OPTIMIZATION; MUTUAL INFORMATION; GENETIC ALGORITHM; CLASSIFICATION; MODEL; SUBSET; HYBRID;
D O I
10.1007/s10489-019-01503-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The manipulation of a large number of features has become a critical problem in Intrusion Detection Systems(IDS). Therefore, Feature Selection (FS) is integrated to select the significant features, in order to avoid the computational complexity, and improve the classification performance. In this paper, we present a new multi-objective feature selection algorithm MOEDAFS (Multi-Objective Estimation of Distribution Algorithms (EDA) for Feature Selection). The MOEDAFS is based on EDA and Mutual Information (MI). EDA is used to explore the search space and MI is integrated as a probabilistic model to guide the search by modeling the redundancy and relevance relations between features. Therefore, we propose four probabilistic models for MOEDAFS. MOEDAFS selects the better feature subsets (non-dominated solutions) that have a better detection accuracy and smaller number of features. MOEDAFS uses two objective functions (minimizing classification Error Rate (ER) and minimizing the Number of Features(NF)). In order to demonstrate the performance of MOEDAFS, a comparative study is designed by internal and external comparison on NSL-KDD dataset. Internal comparison is performed between the four versions of MOEDAFS. External comparison is organized against some well-known deterministic, metaheuristic, and multi-objective feature selection algorithms that have a single and Multi-solution. Experimental results demonstrate that MOEDAFS outperforms recent algorithms.
引用
收藏
页码:4237 / 4257
页数:21
相关论文
共 58 条
[21]   Minimum redundancy feature selection from microarray gene expression data [J].
Ding, C ;
Peng, HC .
PROCEEDINGS OF THE 2003 IEEE BIOINFORMATICS CONFERENCE, 2003, :523-528
[22]  
Dong S., 2018, P INT C WIR COMM SIG, P1
[23]  
Eid HF, 2013, COMM COM INF SC, V381, P240
[24]   Comparative Study on Feature Selection Methods rooted in Swarm Intelligence for Intrusion Detection [J].
Enache, Adriana-Cristina ;
Sgarciu, Valentin ;
Togan, Mihai .
2017 21ST INTERNATIONAL CONFERENCE ON CONTROL SYSTEMS AND COMPUTER SCIENCE (CSCS), 2017, :239-244
[25]   Normalized Mutual Information Feature Selection [J].
Estevez, Pablo. A. ;
Tesmer, Michel ;
Perez, Claudio A. ;
Zurada, Jacek A. .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2009, 20 (02) :189-201
[26]   Feature subset selection wrapper based on mutual information and rough sets [J].
Foithong, Sombut ;
Pinngern, Ouen ;
Attachoo, Boonwat .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (01) :574-584
[27]   An introduction and survey of estimation of distribution algorithms [J].
Hauschild, Mark ;
Pelikan, Martin .
SWARM AND EVOLUTIONARY COMPUTATION, 2011, 1 (03) :111-128
[28]  
Hooks D, 2018, 2018 IEEE 4 INT C BI, P287
[29]   A feature selection approach to find optimal feature subsets for the network intrusion detection system [J].
Kang, Seung-Ho ;
Kim, Kuinam J. .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2016, 19 (01) :325-333
[30]   Multiobjective Estimation of Distribution Algorithm Based on Joint Modeling of Objectives and Variables [J].
Karshenas, Hossein ;
Santana, Roberto ;
Bielza, Concha ;
Larranaga, Pedro .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2014, 18 (04) :519-542