R2CI: Information theoretic-guided feature selection with multiple correlations

被引:31
作者
Wan, Jihong [1 ,2 ,3 ]
Chen, Hongmei [1 ,2 ,3 ]
Li, Tianrui [1 ,2 ,3 ]
Huang, Wei [1 ,2 ,3 ]
Li, Min [1 ,2 ,3 ,4 ]
Luo, Chuan [5 ]
机构
[1] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 611756, Peoples R China
[2] Southwest Jiaotong Univ, Natl Engn Lab Integrated Transportat Big Data App, Chengdu 611756, Peoples R China
[3] Southwest Jiaotong Univ, Mfg Ind Chains Collaborat & Informat Support Tech, Chengdu 611756, Peoples R China
[4] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Computat Intelligence, Chongqing 400065, Peoples R China
[5] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; Information theory; Relevance; Redundancy; Complementarity; Interaction; MUTUAL INFORMATION; RELEVANCE;
D O I
10.1016/j.patcog.2022.108603
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Information theoretic-guided feature selection approaches (ITFSs), which exploit the uncertainty of information to measure the correlation of features, aim to select the most informative features. However, most previous approaches suffer from two drawbacks. 1) Complementarity and interaction are not valued, leading to features with potential discriminatory information for learning tasks such as classification not being excavated and affecting the effectiveness of learning. 2) The various correlations that exist between features for the class have not been fully considered, and their differentiation and relationships have not been well reflected. To address the former issue, guided by information theory, the complementarity and interaction between features are studied. For the latter, firstly, some ITFSs are reviewed and analyzed in terms of feature correlation. The analysis reveals that considering feature multi-correlation is absent in the selection process. Motivated by this problem, a feature selection algorithm with class-based relevance, redundancy, complementarity, and interaction (R2CI) is designed for the first time. Moreover, the distinctions and connections among different correlations are also explored. The results of comparisons and hypothesis test against competitive algorithms show that R2CI has significant advantages in most cases.(c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:16
相关论文
共 40 条
[1]   USING MUTUAL INFORMATION FOR SELECTING FEATURES IN SUPERVISED NEURAL-NET LEARNING [J].
BATTITI, R .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (04) :537-550
[2]   Feature selection using Joint Mutual Information Maximisation [J].
Bennasar, Mohamed ;
Hicks, Yulia ;
Setchi, Rossitza .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (22) :8520-8532
[3]   Feature Interaction Maximisation [J].
Bennasar, Mohamed ;
Setchi, Rossitza ;
Hicks, Yulia .
PATTERN RECOGNITION LETTERS, 2013, 34 (14) :1630-1635
[4]   Maximum relevance minimum common redundancy feature selection for nonlinear data [J].
Che, Jinxing ;
Yang, Youlong ;
Li, Li ;
Bai, Xuying ;
Zhang, Shenghu ;
Deng, Chengzhi .
INFORMATION SCIENCES, 2017, 409 :68-86
[5]   Feature selection with redundancy-complementariness dispersion [J].
Chen, Zhijun ;
Wu, Chaozhong ;
Zhang, Yishi ;
Huang, Zhen ;
Ran, Bin ;
Zhong, Ming ;
Lyu, Nengchao .
KNOWLEDGE-BASED SYSTEMS, 2015, 89 :203-217
[6]   A survey on text generation using generative adversarial networks [J].
de Rosa, Gustavo H. ;
Papa, Joao P. .
PATTERN RECOGNITION, 2021, 119
[7]  
El Akadi A, 2008, INT J COMPUT SCI NET, V8, P116
[8]   Normalized Mutual Information Feature Selection [J].
Estevez, Pablo. A. ;
Tesmer, Michel ;
Perez, Claudio A. ;
Zurada, Jacek A. .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2009, 20 (02) :189-201
[9]  
Fleuret F, 2004, J MACH LEARN RES, V5, P1531
[10]   Feature selection considering the composition of feature relevancy [J].
Gao, Wanfu ;
Hu, Liang ;
Zhang, Ping ;
He, Jialong .
PATTERN RECOGNITION LETTERS, 2018, 112 :70-74