LSEC: Large-scale spectral ensemble clustering

被引:6
作者
Li, Hongmin [1 ]
Ye, Xiucai [1 ]
Imakura, Akira [1 ]
Sakurai, Tetsuya [1 ]
机构
[1] Univ Tsukuba, Dept Comp Sci, Tsukuba, Ibaraki, Japan
关键词
Ensemble clustering; spectral clustering; landmark selection; approximate similarity computation; large-scale clustering; COMBINING MULTIPLE CLUSTERINGS;
D O I
10.3233/IDA-216240
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A fundamental problem in machine learning is ensemble clustering, that is, combining multiple base clusterings to obtain improved clustering result. However, most of the existing methods are unsuitable for large-scale ensemble clustering tasks owing to efficiency bottlenecks. In this paper, we propose a large-scale spectral ensemble clustering (LSEC) method to balance efficiency and effectiveness. In LSEC, a large-scale spectral clustering-based efficient ensemble generation framework is designed to generate various base clusterings with low computational complexity. Thereafter, all the base clusterings are combined using a bipartite graph partition-based consensus function to obtain improved consensus clustering results. The LSEC method achieves a lower computational complexity than most existing ensemble clustering methods. Experiments conducted on ten large-scale datasets demonstrate the efficiency and effectiveness of the LSEC method. The MATLAB code of the proposed method and experimental datasets are available at https://github.com/Li-Hongmin/MyPaperWithCode.
引用
收藏
页码:59 / 77
页数:19
相关论文
共 45 条
[1]  
[Anonymous], 2003, ANN INT ACM SIGIR C
[2]  
Asuncion A., 2007, Uci machine learning repository
[3]   Comparative accuracies of artificial neural networks and discriminant analysis in predicting forest cover types from cartographic variables [J].
Blackard, JA ;
Dean, DJ .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 1999, 24 (03) :131-151
[4]   Large Scale Spectral Clustering Via Landmark-Based Sparse Representation [J].
Cai, Deng ;
Chen, Xinlei .
IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (08) :1669-1680
[5]   Speed up kernel discriminant analysis [J].
Cai, Deng ;
He, Xiaofei ;
Han, Jiawei .
VLDB JOURNAL, 2011, 20 (01) :21-33
[6]   Graph Regularized Nonnegative Matrix Factorization for Data Representation [J].
Cai, Deng ;
He, Xiaofei ;
Han, Jiawei ;
Huang, Thomas S. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (08) :1548-1560
[7]   Parallel Spectral Clustering in Distributed Systems [J].
Chen, Wen-Yen ;
Song, Yangqiu ;
Bai, Hongjie ;
Lin, Chih-Jen ;
Chang, Edward Y. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (03) :568-586
[8]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[9]  
Fern XZ, 2004, P 21 INT C MACH LEAR
[10]   Spectral grouping using the Nystrom method [J].
Fowlkes, C ;
Belongie, S ;
Chung, F ;
Malik, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (02) :214-225