Similarity-based constraint score for feature selection

被引:6
作者
Salmi, Abderezak [1 ]
Hammouche, Kamal [1 ]
Macaire, Ludovic [2 ]
机构
[1] Univ Mouloud Mammeri, Lab Vis Artificielle & Automat Syst LVAAS, Tizi Ouzou, Algeria
[2] Univ Lille, UMR 9189, Cent Lille, CRIStAL Ctr Rech Informat Signal & Automat Lille, F-59000 Lille, France
关键词
Constraint score; Feature selection; Pairwise constraints; Similarity matrix; SUPERVISED FEATURE-SELECTION; RELEVANCE; EFFICIENT;
D O I
10.1016/j.knosys.2020.106429
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To avoid the curse of dimensionality resulting from a large number of features, the most relevant features should be selected. Several scores involving must-link and cannot-link constraints have been proposed to estimate the relevance of features. However, these constraint scores evaluate features one by one and ignore any correlation between them. In addition, they compute distance in the high-dimensional original feature space to evaluate similarity between samples. So, they would be corrupted by the curse of dimensionality. To deal with these drawbacks, we propose a new constraint score based on a similarity matrix that is computed in the selected feature subspace and that makes it possible to evaluate the relevance of a feature subset at once. Experiments on benchmark databases demonstrate the improvement brought by the proposed constraint score in the context of both supervised and semi-supervised learnings. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 23 条
[1]  
[Anonymous], 2001, P 18 INT C MACH LEAR
[2]   Ensemble constrained Laplacian score for efficient and robust semi-supervised feature selection [J].
Benabdeslem, Khalid ;
Elghazel, Haytham ;
Hindawi, Mohammed .
KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 49 (03) :1161-1185
[3]   Efficient Semi-Supervised Feature Selection: Constraint, Relevance, and Redundancy [J].
Benabdeslem, Khalid ;
Hindawi, Mohammed .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (05) :1131-1143
[4]  
Benabdeslem K, 2011, LECT NOTES ARTIF INT, V6911, P204, DOI 10.1007/978-3-642-23780-5_23
[5]   Feature selection in machine learning: A new perspective [J].
Cai, Jie ;
Luo, Jiawei ;
Wang, Shulin ;
Yang, Sheng .
NEUROCOMPUTING, 2018, 300 :70-79
[6]  
He X, 2005, P ADV NEUR INF PROC, P507, DOI [10.5555/2976248.2976312, DOI 10.5555/2976248.2976312]
[7]   Constraint Score Evaluation for Spectral Feature Selection [J].
Kalakech, Mariam ;
Biela, Philippe ;
Hamad, Denis ;
Macaire, Ludovic .
NEURAL PROCESSING LETTERS, 2013, 38 (02) :155-175
[8]   Constraint scores for semi-supervised feature selection: A comparative study [J].
Kalakech, Mariam ;
Biela, Philippe ;
Macaire, Ludovic ;
Hamad, Denis .
PATTERN RECOGNITION LETTERS, 2011, 32 (05) :656-665
[9]  
Kamvar S.D., 2003, INT JOINT C ART INT
[10]  
Lichman M., 2013, UCI machine learning repository