Semi-supervised neighborhood discrimination index for feature selection

被引:31
作者
Pang, Qing-Qing [1 ,2 ]
Zhang, Li [1 ,2 ,3 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Jiangsu, Peoples R China
[2] Soochow Univ, Joint Int Res Lab Machine Learning & Neuromorph C, Suzhou 215006, Jiangsu, Peoples R China
[3] Soochow Univ, Prov Key Lab Comp Informat Proc Technol, Suzhou 215006, Jiangsu, Peoples R China
关键词
Semi-supervised; Feature selection; Neighborhood discriminant index; MUTUAL INFORMATION; REGRESSION;
D O I
10.1016/j.knosys.2020.106224
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neighborhood discriminant index (NDI) is an effective feature selection method for supervised learning. In reality, it is easy to obtain unlabeled data and is costly to tag them all. Thus, the given dataset commonly has only a small amount of tagged samples and a large amount of unlabeled ones, which cannot be handled by supervised learning methods. For this situation, we propose a semi-supervised feature selection method called semi-supervised neighborhood discriminant index (SSNDI) that combines NDI and the Laplacian score method to effectively deal with both labeled and unlabeled samples. The goal of SSNDI is to find an optimal feature subset that has a good ability to keep local geometrical structure and to distinguish samples belonging to different classes. In SSNDI, the classical Laplacian score method is modified to cooperate the iterative form of NDI. In each iteration, SSNDI picks up an important feature according to the new criterion that is a mixture of NDI and the modified Laplacian score. Extensive experiments are conducted on UCI and microarray gene datasets. Experimental results confirm that SSNDI can achieve a better performance than NDI and the other state-of-the-art semi-supervised methods. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:11
相关论文
共 41 条
[11]   Class-specific mutual information variation for feature selection [J].
Gao, Wanfu ;
Hu, Liang ;
Zhang, Ping .
PATTERN RECOGNITION, 2018, 79 :328-339
[12]  
GARNAUT R, 1992, ECONOMIC REFORM AND INTERNATIONALISATION: CHINA AND THE PACIFIC REGION, P1
[13]  
Han, 2011, P 27 C UNC ART INT U, P266
[14]   Semisupervised Feature Selection via Spline Regression for Video Semantic Recognition [J].
Han, Yahong ;
Yang, Yi ;
Yan, Yan ;
Ma, Zhigang ;
Sebe, Nicu ;
Zhou, Xiaofang .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (02) :252-264
[15]  
He X, 2005, P ADV NEUR INF PROC, P507, DOI [10.5555/2976248.2976312, DOI 10.5555/2976248.2976312]
[16]   Measuring relevance between discrete and continuous features based on neighborhood mutual information [J].
Hu, Qinghua ;
Zhang, Lei ;
Zhang, David ;
Pan, Wei ;
An, Shuang ;
Pedrycz, Witold .
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (09) :10737-10750
[17]   Feature clustering based support vector machine recursive feature elimination for gene selection [J].
Huang, Xiaojuan ;
Zhang, Li ;
Wang, Bangjun ;
Li, Fanzhang ;
Zhang, Zhao .
APPLIED INTELLIGENCE, 2018, 48 (03) :594-607
[18]   Semi-Supervised Maximum Discriminative Local Margin for Gene Selection [J].
Li, Zejun ;
Liao, Bo ;
Cai, Lijun ;
Chen, Min ;
Liu, Wenhua .
SCIENTIFIC REPORTS, 2018, 8
[19]   Semi-supervised multi-view clustering with Graph-regularized Partially Shared Non-negative Matrix Factorization [J].
Liang, Naiyao ;
Yang, Zuyuan ;
Li, Zhenni ;
Xie, Shengli ;
Su, Chun-Yi .
KNOWLEDGE-BASED SYSTEMS, 2020, 190
[20]   Online Multi-label Group Feature Selection [J].
Liu, Jinghua ;
Lin, Yaojin ;
Wu, Shunxiang ;
Wang, Chenxi .
KNOWLEDGE-BASED SYSTEMS, 2018, 143 :42-57