Robust multi-view data clustering with multi-view capped-norm K-means

被引:64
作者
Huang, Shudong [1 ]
Ren, Yazhou [1 ]
Xu, Zenglin [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, SMILE Lab, Chengdu 610031, Sichuan, Peoples R China
关键词
Multi-view clustering; Capped-norm; Robust clustering; MATRIX FACTORIZATION; MODELS; SCALE;
D O I
10.1016/j.neucom.2018.05.072
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Real-world data sets are often comprised of multiple representations or views which provide different and complementary aspects of information. Multi-view clustering is an important approach to analyze multi-view data in a unsupervised way. Previous studies have shown that better clustering accuracy can be achieved using integrated information from all the views rather than just relying on each view individually. That is, the hidden patterns in data can be better explored by discovering the common latent structure shared by multiple views. However, traditional multi-view clustering methods are usually sensitive to noises and outliers, which greatly impair the clustering performance in practical problems. Furthermore, existing multi-view clustering methods, e.g. graph-based methods, are with high computational complexity due to the kernel/affinity matrix construction or the eigendecomposition. To address these problems, we propose a novel robust multi-view clustering method to integrate heterogeneous representations of data. To make our method robust to the noises and outliers, especially the extreme data outliers, we utilize the capped-norm loss as the objective. The proposed method is of low complexity, and in the same level as the classic K-means algorithm, which is a major advantage for unsupervised learning. We derive a new efficient optimization algorithm to solve the multi-view clustering problem. Finally, extensive experiments on benchmark data sets show that our proposed method consistently outperforms the state-of-the-art clustering methods. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:197 / 208
页数:12
相关论文
共 52 条
[1]  
[Anonymous], 2011, INT C MACHINE LEARNI
[2]  
[Anonymous], EXP SYST APPL
[3]  
[Anonymous], P INT JOINT C NEUR N
[4]  
[Anonymous], 2013, COMPUTATIONAL IMAGIN
[5]  
[Anonymous], 2012, Adv. Neural Inf. Process. Syst
[6]  
[Anonymous], 2011, P 25 AAAI C ART INT
[7]  
[Anonymous], P INT C MACH LEARN
[8]  
[Anonymous], 2013, P 2013 SIAM INT C DA
[9]  
[Anonymous], 2011, P 20 ACM INT C INF K, DOI DOI 10.1145/2063576.2063676
[10]   Multi-view clustering [J].
Bickel, S ;
Scheffer, T .
FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, :19-26