Robust unsupervised feature selection via matrix factorization

被引：46

作者：

Du, Shiqiang ^{[1
,2
]}

Ma, Yide ^{[1
]}

Li, Shouliang ^{[1
]}

Ma, Yurun ^{[1
]}

机构：

[1] Lanzhou Univ, Sch Informat Sci & Engn, Lanzhou 730000, Peoples R China

[2] Northwest Univ Nationalities, Sch Math & Comp Sci, Lanzhou 730030, Peoples R China

来源：

NEUROCOMPUTING | 2017年 / 241卷

基金：

高等学校博士学科点专项科研基金; 中国国家自然科学基金;

关键词：

Unsupervised feature selection; Matrix factorization; Manifold regularization; l(2,1)-norm; LOW-RANK REPRESENTATION; FORMULATION; INFORMATION;

D O I：

10.1016/j.neucom.2017.02.034

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Dimensionality reduction is a challenging task for high-dimensional data processing in machine learning and data mining. It can help to reduce computation time, save storage space and improve the performance of learning algorithms. As an effective dimension reduction technique, unsupervised feature selection aims at finding a subset of features to retain the most relevant information. In this paper, we propose a novel unsupervised feature selection method, called Robust Unsupervised Feature Selection via Matrix Factorization (RUFSM), in which robust discriminative feature selection and robust clustering are performed simultaneously under l(2),(1)-norm while the local manifold structures of data are preserved. The advantages of this work are three-fold. Firstly, both the latent orthogonal cluster centers and the sparse representation of the projected data points based on matrix factorization are predicted for selecting robust discriminative features. Secondly, the feature selection and the clustering are performed simultaneously to guarantee an overall optimum. Thirdly, an efficient iterative update algorithm, which is based on Alternating Direction Method of Multipliers (ADMM), is used for RUFSM optimization. Compared with several state-of-the-art unsupervised feature selection methods, the proposed algorithm comes with better clustering performance for almost all datasets we have experimented with here. (C) 2017 Elsevier B.V. All rights reserved.

引用

页码：115 / 127

页数：13

共 48 条

[11] Locally Consistent Concept Factorization for Document Clustering [J].

Cai, Deng ;

He, Xiaofei ;

Han, Jiawei .

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (06) :902-913

[12] Graph Regularized Nonnegative Matrix Factorization for Data Representation [J].

Cai, Deng ;

He, Xiaofei ;

Han, Jiawei ;

Huang, Thomas S. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (08) :1548-1560

[13]

Chang XJ, 2014, AAAI CONF ARTIF INTE, P1171

[14]

Duch W, 2004, IEEE IJCNN, P1415

[15]

Dy JG, 2004, J MACH LEARN RES, V5, P845

[16]

Eldén L, 2007, FUND ALGORITHMS, V4, pIX, DOI 10.1137/1.9780898718867

[17] Sparse Subspace Clustering: Algorithm, Theory, and Applications [J].

Elhamifar, Ehsan ;

Vidal, Rene .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (11) :2765-2781

[18] Finding Structure with Randomness: Probabilistic Algorithms for Constructing Approximate Matrix Decompositions [J].

Halko, N. ;

Martinsson, P. G. ;

Tropp, J. A. .

SIAM REVIEW, 2011, 53 (02) :217-288

[19]

He X., 2005, P 18 INT C NEUR INF, P507

[20] Robust Manifold Nonnegative Matrix Factorization [J].

Huang, Jin ;

Nie, Feiping ;

Huang, Heng ;

Ding, Chris .

ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2014, 8 (03)

← 1 2 3 4 5 →