Unsupervised feature selection via low-rank approximation and structure learning

被引:57
作者
Wang, Shiping [1 ,2 ]
Wang, Han [2 ]
机构
[1] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou 350116, Peoples R China
[2] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore, Singapore
基金
中国国家自然科学基金;
关键词
Machine learning; Feature selection; Unsupervised learning; Low-rank approximation; Structure learning;
D O I
10.1016/j.knosys.2017.03.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is an important research topic in machine learning and computer vision in that it can reduce the dimensionality of input data and improve the performance of learning algorithms. Low-rank approximation techniques can well exploit the low-rank property of input data, which coincides with the internal consistency of dimensionality reduction. In this paper, we propose an efficient unsupervised feature selection algorithm, which incorporates low-rank approximation as well as structure learning. First, using the self-representation of data matrix, we formalize the feature selection problem as a matrix factorization with low-rank constraints. This matrix factorization formulation also embeds structure learning regularization as well as a sparse regularized term. Second, we present an effective technique to approximate low-rank constraints and propose a convergent algorithm in a batch mode. This technique can serve as an algorithmic framework for general low-rank recovery problems as well. Finally, the proposed algorithm is validated in twelve publicly available datasets from machine learning repository. Extensive experimental results demonstrate that the proposed method is capable to achieve competitive performance compared to existing state-of-the-art feature selection methods in terms of clustering performance. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:70 / 79
页数:10
相关论文
共 46 条
[1]  
[Anonymous], IEEE TRANS SYST MAN
[2]  
[Anonymous], IEEE T CYBERN
[3]  
[Anonymous], 2011, IJCAI INT JOINT C AR
[4]  
[Anonymous], IEEE T NEURAL NETWOR
[5]  
[Anonymous], 2013, IJCAI
[6]  
[Anonymous], 2011, P 28 INT C MACH LEAR
[7]  
[Anonymous], 2012, ARXIV12071404
[8]   Graph Regularized Nonnegative Matrix Factorization for Data Representation [J].
Cai, Deng ;
He, Xiaofei ;
Han, Jiawei ;
Huang, Thomas S. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (08) :1548-1560
[9]   Consistency-based search in feature selection [J].
Dash, M ;
Liu, HA .
ARTIFICIAL INTELLIGENCE, 2003, 151 (1-2) :155-176
[10]   Hessian eigenmaps: Locally linear embedding techniques for high-dimensional data [J].
Donoho, DL ;
Grimes, C .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (10) :5591-5596