High-order conditional mutual information maximization for dealing with high-order dependencies in feature selection

被引:23
作者
Souza, Francisco [1 ]
Premebida, Cristiano [1 ]
Araujo, Rui [1 ]
机构
[1] Univ Coimbra, Inst Syst & Robot, Coimbra, Portugal
关键词
Feature selection; Mutual information; Information theory; Pattern recognition; SHRINKAGE;
D O I
10.1016/j.patcog.2022.108895
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel feature selection method based on the conditional mutual information (CMI). The proposed High Order Conditional Mutual Information Maximization (HOCMIM) method incorporates high order dependencies into the feature selection procedure and has a straightforward interpretation due to its bottom-up derivation. The HOCMIM is derived from the CMI's chain expansion and expressed as a maximization optimization problem. The maximization problem is solved using a greedy search pro-cedure, which speeds up the entire feature selection process. The experiments are run on a set of bench-mark datasets (20 in total). The HOCMIM is compared with eighteen state-of-the-art feature selection al-gorithms, from the results of two supervised learning classifiers (Support Vector Machine and K-Nearest Neighbor). The HOCMIM achieves the best results in terms of accuracy and shows to be faster than high order feature selection counterparts. (c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:13
相关论文
共 38 条
[1]  
Ash R. B., 1990, INFORM THEORY
[2]   On the Feature Selection Criterion Based on an Approximation of Multidimensional Mutual Information [J].
Balagani, Kiran S. ;
Phoha, Vir V. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (07) :1342-1343
[3]   USING MUTUAL INFORMATION FOR SELECTING FEATURES IN SUPERVISED NEURAL-NET LEARNING [J].
BATTITI, R .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (04) :537-550
[4]   Feature selection using Joint Mutual Information Maximisation [J].
Bennasar, Mohamed ;
Hicks, Yulia ;
Setchi, Rossitza .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (22) :8520-8532
[5]  
Brown G, 2012, J MACH LEARN RES, V13, P27
[6]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[7]  
Dua D, 2017, UCI machine learning repository
[8]   Normalized Mutual Information Feature Selection [J].
Estevez, Pablo. A. ;
Tesmer, Michel ;
Perez, Claudio A. ;
Zurada, Jacek A. .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2009, 20 (02) :189-201
[9]  
Fleuret F, 2004, J MACH LEARN RES, V5, P1531
[10]  
Grassberger P., 2003, ARXIV