Adaptive Unsupervised Feature Selection With Structure Regularization

被引:200
作者
Luo, Minnan [1 ]
Nie, Feiping [2 ]
Chang, Xiaojun [3 ]
Yang, Yi [4 ]
Hauptmann, Alexander G. [3 ]
Zheng, Qinghua [1 ]
机构
[1] Xi An Jiao Tong Univ, Dept Comp Sci, SPKLSTN Lab, Xian 710049, Shaanxi, Peoples R China
[2] Northwestern Polytech Univ, Ctr OPTical Imagery Anal & Learning, Xian 710000, Shaanxi, Peoples R China
[3] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
[4] Univ Technol Sydney, Ctr Quantum Computat & Intelligent Syst, Ultimo, NSW 2007, Australia
关键词
Adaptive neighbors; dimension reduction; local linear embedding; structure regularization; unsupervised feature selection; NONLINEAR DIMENSIONALITY REDUCTION; CLASSIFICATION; ALGORITHM;
D O I
10.1109/TNNLS.2017.2650978
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is one of the most important dimension reduction techniques for its efficiency and interpretation. Since practical data in large scale are usually collected without labels, and labeling these data are dramatically expensive and time-consuming, unsupervised feature selection has become a ubiquitous and challenging problem. Without label information, the fundamental problem of unsupervised feature selection lies in how to characterize the geometry structure of original feature space and produce a faithful feature subset, which preserves the intrinsic structure accurately. In this paper, we characterize the intrinsic local structure by an adaptive reconstruction graph and simultaneously consider its multiconnected-components (multi-cluster) structure by imposing a rank constraint on the corresponding Laplacian matrix. To achieve a desirable feature subset, we learn the optimal reconstruction graph and selective matrix simultaneously, instead of using a predetermined graph. We exploit an efficient alternative optimization algorithm to solve the proposed challenging problem, together with the theoretical analyses on its convergence and computational complexity. Finally, extensive experiments on clustering task are conducted over several benchmark data sets to verify the effectiveness and superiority of the proposed unsupervised feature selection algorithm.
引用
收藏
页码:944 / 956
页数:13
相关论文
共 76 条
  • [1] Aggarwal C. C., 2001, ON THE SURPRISING BE
  • [2] [Anonymous], 2012, Face Recognition: From Theory to Applications
  • [3] [Anonymous], 2013, INT C MACH LEARN
  • [4] [Anonymous], 2004, SIGKDD EXPLOR, DOI DOI 10.1145/1007730.1007731
  • [5] [Anonymous], 2012, FEATURE SELECTION KN
  • [6] [Anonymous], 2013, IJCAI
  • [7] [Anonymous], 1973, Pattern Classification and Scene Analysis
  • [8] Ben-Bassat M., 1982, Handbook of statistics, V2, P773, DOI DOI 10.1016/S0169-7161(82)02038-0
  • [9] Selection of relevant features and examples in machine learning
    Blum, AL
    Langley, P
    [J]. ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) : 245 - 271
  • [10] Cai D, 2010, P 16 ACM SIGKDD INT, P333, DOI DOI 10.1145/1835804.1835848