Generalized Uncorrelated Regression with Adaptive Graph for Unsupervised Feature Selection

被引:148
作者
Li, Xuelong [1 ,2 ]
Zhang, Han [1 ,2 ]
Zhang, Rui [3 ,4 ]
Liu, Yun [5 ]
Nie, Feiping [1 ,2 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China
[2] Northwestern Polytech Univ, Ctr OPTical IMagery Anal & Learning OPTIMAL, Xian 710072, Shaanxi, Peoples R China
[3] Chinese Acad Sci, Xian Inst Opt & Precis Mech, State Key Lab Transient Opt & Photon, Ctr OPTical IMagery Anal & Learning, Xian 710119, Shaanxi, Peoples R China
[4] Chinese Acad Sci, Haixi Inst, Quanzhou Inst Equipment Mfg, Quanzhou 362000, Peoples R China
[5] Univ Texas Arlington, Dept Comp Sci & Engn, Arlington, TX 76019 USA
基金
中国国家自然科学基金;
关键词
Generalized uncorrelated constraint; maximum entropy; regression model; unsupervised feature selection; RECOGNITION; FRAMEWORK; FACE;
D O I
10.1109/TNNLS.2018.2868847
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised feature selection always occupies a key position as a preprocessing in the tasks of classification or clustering due to the existence of extra essential features within high-dimensional data. Although lots of efforts have been made, the existing methods neglect to consider the redundancy of features, and thus select redundant features. In this brief, by virtue of a generalized uncorrelated constraint, we present an improved sparse regression model [generalized uncorrelated regression model (GURM)] for seeking the uncorrelated yet discriminative features. Benefited from this, the structure of data is kept in the Stiefel manifold, which avoids the potential trivial solution triggered by a conventional ridge regression model. Besides that, the uncorrelated constraint equips the model with the closed-form solution. In addition, we also incorporate a graph regularization term based on the principle of maximum entropy into the GURM model (URAFS), so as to embed the local geometric structure of data into the manifold learning. An efficient algorithm is designed to perform URAFS by virtue of the existing generalized powered iteration method. Extensive experiments on eight benchmark data sets among seven state-of-the-art methods on the task of clustering are conducted to verify the effectiveness and superiority of the proposed method.
引用
收藏
页码:1587 / 1595
页数:9
相关论文
共 35 条
[1]  
[Anonymous], 2004, TECH REP
[2]  
[Anonymous], 2010, P 16 ACM SIGKDD INT, DOI [10.1145/1835804.1835848, DOI 10.1145/1835804.1835848]
[3]  
[Anonymous], 2012, P AAAI C ART INT
[4]  
[Anonymous], P NAT C ART INT
[5]  
[Anonymous], P NAT ACAD SCI US
[6]  
[Anonymous], 2014, IEEE INT JOINT C BIO, DOI DOI 10.1109/BTAS.2014.6996222
[7]  
[Anonymous], 2002, Journal of Machine Learning Research
[8]  
[Anonymous], 2013, P 22 INT JOINT C ART, DOI DOI 10.5555/2540128.2540361
[9]  
Gu Q., 2011, Proceedings of the Twenty-Second international joint conference on Artificial Intelligence, V2, P1294
[10]   Feature Selection Based on Structured Sparsity: A Comprehensive Study [J].
Gui, Jie ;
Sun, Zhenan ;
Ji, Shuiwang ;
Tao, Dacheng ;
Tan, Tieniu .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (07) :1490-1507