Normalized Mutual Information Feature Selection

被引:869
作者
Estevez, Pablo. A. [1 ]
Tesmer, Michel [2 ]
Perez, Claudio A. [1 ]
Zurada, Jacek A. [3 ]
机构
[1] Univ Chile, Dept Elect Engn, Santiago 8370451, Chile
[2] Cruz Blanca SA, Santiago, Chile
[3] Univ Louisville, Dept Comp & Elect Engn, Louisville, KY 40208 USA
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 2009年 / 20卷 / 02期
关键词
Feature selection; genetic algorithms; multilayer perceptron (MLP) neural networks; normalized mutual information (MI); GENETIC ALGORITHMS; RELEVANCE;
D O I
10.1109/TNN.2008.2005601
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A filter method of feature selection based on mutual information, called normalized mutual information feature selection (NMIFS), is presented. NMIFS is an enhancement over Battiti's MIFS, MIFS-U, and mRMR methods. The average normalized mutual information is proposed as a measure of redundancy among features. NMIFS outperformed MIFS, MIFS-U, and mRMR on several artificial and benchmark date sets without requiring a user-defined parameter. In addition, MIFS is combined with a genetic algorithm to form a hybrid filter/wrapper method called GAMIFS. This includes an initialization procedure and a mutation operator based on NMIFS to speed up the convergence of the genetic algorithm. GAMIFS overcomes the limitations of incremental search algorithms that are unable to find dependencies between groups of features.
引用
收藏
页码:189 / 201
页数:13
相关论文
共 47 条
[1]  
[Anonymous], Journal of machine learning research
[2]   USING MUTUAL INFORMATION FOR SELECTING FEATURES IN SUPERVISED NEURAL-NET LEARNING [J].
BATTITI, R .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (04) :537-550
[3]  
Bins J, 2001, EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL II, PROCEEDINGS, P159, DOI 10.1109/ICCV.2001.937619
[4]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[5]   Feature selection, mutual information, and the classification of high-dimensional patterns [J].
Bonev, Boyan ;
Escolano, Francisco ;
Cazorla, Miguel .
PATTERN ANALYSIS AND APPLICATIONS, 2008, 11 (3-4) :309-319
[6]  
Box G. E. P., 2003, TIME SERIES ANAL
[7]   FAST GENETIC SELECTION OF FEATURES FOR NEURAL NETWORK CLASSIFIERS [J].
BRILL, FZ ;
BROWN, DE ;
MARTIN, WN .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1992, 3 (02) :324-328
[8]   Estimating optimal feature subsets using efficient estimation of high-dimensional mutual information [J].
Chow, TWS ;
Huang, D .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2005, 16 (01) :213-224
[9]  
COVER T. M., 2006, Elements of information theory, V2nd
[10]  
Dash M., 1997, Intelligent Data Analysis, V1