A robust dimensionality reduction and matrix factorization framework for data clustering

被引:14
作者
Li, Ruyue [1 ]
Zhang, Lefei [1 ,2 ]
Du, Bo [1 ]
机构
[1] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Sch Comp Sci, Wuhan 430072, Hubei, Peoples R China
[2] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430072, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Clustering; Matrix factorization; Dimensionality reduction; Manifold regularization;
D O I
10.1016/j.patrec.2019.10.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most existing Non-negative Matrix Factorization (NMF) related data clustering techniques directly decompose the original feature space while have not well considered the fact that the low dimensional feature space is always embedded in the high dimensional feature space and can better reveal the spatial distribution of data. In this letter, we propose a new matrix factorization model, which unites the objectives of clustering and dimensionality reduction simultaneously. In the proposed framework, the clustering based on matrix factorization is actually executed on the embedded subspace which may provide more accurate and reasonable solutions. Furthermore, we use the l(2,1)-norm instead of the conventional l(2)-norm to enhance the clustering results and make the clustering framework more robust to the noises and outliers. Meanwhile, in order to preserve as much as possible local similarity of the data, we have also employed an affinity matrix with special learning to introduce the manifold learning into the cluster indicator matrix. An optimization procedure based on Augmented Lagrangian Method (ALM) is devised to effectively solve the proposed problem and explicitly show the clustering results. Experimental results on the benchmark datasets with different proprieties exhibit the superior performance of the proposed method. (C) 2019 Published by Elsevier B.V.
引用
收藏
页码:440 / 446
页数:7
相关论文
共 32 条
  • [1] A Semi-NMF-PCA Unified Framework for Data Clustering
    Allab, Kais
    Labiod, Lazhar
    Nadif, Mohamed
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (01) : 2 - 16
  • [2] [Anonymous], IEEE T NEURAL NETW L
  • [3] [Anonymous], 2018, IEEE T NEURAL NETWOR, DOI DOI 10.1109/TNNLS.2018.2868847
  • [4] [Anonymous], 2008, IEEE T PATTERN ANAL, DOI DOI 10.1109/tpami.2008.277
  • [5] [Anonymous], 2015, IEEE T NEURAL NETWOR
  • [6] Graph Regularized Nonnegative Matrix Factorization for Data Representation
    Cai, Deng
    He, Xiaofei
    Han, Jiawei
    Huang, Thomas S.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (08) : 1548 - 1560
  • [7] Robust clustering of noisy high-dimensional gene expression data for patients subtyping
    Coretto, Pietro
    Serra, Angela
    Tagliaferri, Roberto
    [J]. BIOINFORMATICS, 2018, 34 (23) : 4064 - 4072
  • [9] Structured sparse K-means clustering via Laplacian smoothing
    Gong, Weikang
    Zhao, Renbo
    Grunewald, Stefan
    [J]. PATTERN RECOGNITION LETTERS, 2018, 112 : 63 - 69
  • [10] Joint Color-Spatial-Directional Clustering and Region Merging (JCSD-RM) for Unsupervised RGB-D Image Segmentation
    Hasnat, Md. Abul
    Alata, Olivier
    Tremeau, Alain
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (11) : 2255 - 2268