Anchor-based fast spectral ensemble clustering

被引:11
作者
Zhang, Runxin [1 ,2 ]
Hang, Shuaijun [3 ]
Sun, Zhensheng [4 ]
Nie, Feiping [1 ,2 ,3 ]
Wang, Rong [1 ,2 ]
Li, Xuelong [5 ]
机构
[1] Northwestern Polytech Univ, Sch Artificial Intelligence Opt & Elect iOPEN, Xian 710072, Shaanxi, Peoples R China
[2] Northwestern Polytech Univ, Key Lab Intelligent Interact & Applicat, Minist Ind & Informat Technol, Xian 710072, Shaanxi, Peoples R China
[3] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China
[4] Rocket Force Univ Engn, Zhijian Lab, Xian 710025, Shaanxi, Peoples R China
[5] China Telecom Corp Ltd, Inst Artificial Intelligence TeleAI, Beijing 100033, Peoples R China
基金
中国国家自然科学基金;
关键词
Ensemble clustering; Spectral clustering; Anchor graph; Bipartite graph; MULTIVIEW; ALGORITHM;
D O I
10.1016/j.inffus.2024.102587
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ensemble clustering can obtain better and more robust results by fusing multiple base clusterings, which has received extensive attention. Although many representative algorithms have emerged in recent years, this field still has two tricky problems. First, spectral clustering can identify clusters of arbitrary shapes, but the high time and space complexity limit its application in generating base clusterings. Most existing algorithms utilize k-means to generate base clusterings, and the clustering effect on nonlinearly separable datasets needs further improvement. Second, ensemble clustering algorithms should generate multiple base clusterings. Even if low-complexity algorithms are applied, the running time is also long, which seriously affects the application of ensemble clustering algorithms on large-scale datasets. To tackle these problems, we propose a fast Knearest neighbors approximation method, construct an anchor graph to approximate the similarity matrix, and use singular value decomposition (SVD) instead of eigenvalue decomposition (EVD) to reduce the time and space complexity of conventional spectral clustering. At the same time, we obtain multiple base clusterings by running spectral embedding once. Finally, we convert these base clusterings into a bipartite graph and use transfer cut to get the final clustering results. The proposed algorithms significantly reduce the running time of ensemble clustering. Experimental results on large-scale datasets fully prove the efficiency and superiority of our proposed algorithm.
引用
收藏
页数:13
相关论文
共 50 条
[1]  
Ahmadian S, 2018, 2018 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), P1139, DOI 10.1109/ASONAM.2018.8508723
[2]   Hierarchical Image Segmentation Using Correlation Clustering [J].
Alush, Amir ;
Goldberger, Jacob .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (06) :1358-1367
[3]   A multiple k -means clustering ensemble algorithm to find nonlinearly separable clusters [J].
Bai, Liang ;
Liang, Jiye ;
Cao, Fuyuan .
INFORMATION FUSION, 2020, 61 :36-47
[4]  
Baraldi A, 1999, IEEE T SYST MAN CY B, V29, P778, DOI 10.1109/3477.809032
[5]   Large Scale Spectral Clustering Via Landmark-Based Sparse Representation [J].
Cai, Deng ;
Chen, Xinlei .
IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (08) :1669-1680
[6]   Discriminative Hierarchical K-Means Tree for Large-Scale Image Classification [J].
Chen, Shizhi ;
Yang, Xiaodong ;
Tian, Yingli .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (09) :2200-2205
[7]   Parallel Spectral Clustering in Distributed Systems [J].
Chen, Wen-Yen ;
Song, Yangqiu ;
Bai, Hongjie ;
Lin, Chih-Jen ;
Chang, Edward Y. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (03) :568-586
[8]   Fast and Accurate Hierarchical Clustering Based on Growing Multilayer Topology Training [J].
Cheung, Yiu-ming ;
Zhang, Yiqun .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (03) :876-890
[9]  
Diday E., 1981, Digital Image Processing. Proceedings of the NATO Advanced Study Institute, P19
[10]   Multi-view spectral clustering via constrained nonnegative embedding [J].
El Hajjar, S. ;
Dornaika, F. ;
Abdallah, F. .
INFORMATION FUSION, 2022, 78 :209-217