Global mutual information-based feature selection approach using single-objective and multi-objective optimization

被引:55
作者
Han, Min [1 ]
Ren, Weijie [1 ]
机构
[1] Dalian Univ Technol, Fac Elect Informat & Elect Engn, Dalian 116024, Liaoning, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; Mutual information; Search strategy; Multi-objective optimization; Time series; PARTICLE SWARM OPTIMIZATION; MULTIVARIATE TIME-SERIES; INPUT VARIABLE SELECTION; GENETIC ALGORITHM; REDUNDANCY; PREDICTION; CLASSIFICATION; RELEVANCE; DIAGNOSIS; FRAMEWORK;
D O I
10.1016/j.neucom.2015.06.016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is an important preprocessing step in data mining. Mutual information-based feature selection is a kind of popular and effective approaches. In general, most existing mutual information-based techniques are greedy methods, which are proven to be efficient but suboptimal. In this paper, mutual information-based feature selection is transformed into a global optimization problem, which provides a new idea for solving feature selection problems. First, a single-objective feature selection algorithm combining relevance and redundancy is presented, which has well global searching ability and high computational efficiency. Furthermore, to improve the performance of feature selection, we propose a multi-objective feature selection algorithm. The method can meet different requirements and achieve a tradeoff among multiple conflicting objectives. On this basis, a hybrid feature selection framework is adopted for obtaining a final solution. We compare the performance of our algorithm with related methods on both synthetic and real datasets. Simulation results show the effectiveness and practicality of the proposed method. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:47 / 54
页数:8
相关论文
共 42 条
[1]   Investigation of Feature Selection Techniques for Improving Efficiency of Power Transformer Condition Assessment [J].
Ashkezari, Atefeh Dehghani ;
Ma, Hui ;
Saha, Tapan K. ;
Cui, Yi .
IEEE TRANSACTIONS ON DIELECTRICS AND ELECTRICAL INSULATION, 2014, 21 (02) :836-844
[2]  
Battiti R., 1994, NEURAL NETW IEEE T, V5, P537, DOI DOI 10.1109/72.298224
[3]  
Brown G, 2012, J MACH LEARN RES, V13, P27
[4]   Optimal sub-models selection algorithm for combination forecasting model [J].
Che, JinXing .
NEUROCOMPUTING, 2015, 151 :364-375
[5]   A fast and elitist multiobjective genetic algorithm: NSGA-II [J].
Deb, K ;
Pratap, A ;
Agarwal, S ;
Meyarivan, T .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2002, 6 (02) :182-197
[6]   Mutual information-based feature selection for multilabel classification [J].
Doquire, Gauthier ;
Verleysen, Michel .
NEUROCOMPUTING, 2013, 122 :148-155
[7]   Normalized Mutual Information Feature Selection [J].
Estevez, Pablo. A. ;
Tesmer, Michel ;
Perez, Claudio A. ;
Zurada, Jacek A. .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2009, 20 (02) :189-201
[8]   Mutual-Information-Based Semi-Supervised Hyperspectral Band Selection With High Discrimination, High Information, and Low Redundancy [J].
Feng, Jie ;
Jiao, Licheng ;
Liu, Fang ;
Sun, Tao ;
Zhang, Xiangrong .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2015, 53 (05) :2956-2969
[9]   MULTIVARIATE ADAPTIVE REGRESSION SPLINES [J].
FRIEDMAN, JH .
ANNALS OF STATISTICS, 1991, 19 (01) :1-67
[10]   An evaluation framework for input variable selection algorithms for environmental data-driven models [J].
Galelli, Stefano ;
Humphrey, Greer B. ;
Maier, Holger R. ;
Castelletti, Andrea ;
Dandy, Graeme C. ;
Gibbs, Matthew S. .
ENVIRONMENTAL MODELLING & SOFTWARE, 2014, 62 :33-51