An Efficient Spectral Clustering Algorithm Based on Granular-Ball

被引:48
作者
Xie, Jiang [1 ]
Kong, Weiyu [1 ]
Xia, Shuyin [1 ]
Wang, Guoyin [1 ]
Gao, Xinbo [1 ]
机构
[1] Chongqing Univ Telecommun & Posts, Chongqing Key Lab Computat Intelligence, Chongqing 400065, Peoples R China
基金
中国国家自然科学基金;
关键词
Clustering; granular computing; granular-ball; spectral clustering;
D O I
10.1109/TKDE.2023.3249475
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order to solve the problem that the traditional spectral clustering algorithm is time-consuming and resource consuming when applied to large-scale data, resulting in poor clustering effect or even unable to cluster, this paper proposes a spectral clustering algorithm based on granular-ball(GBSC). The algorithm changes the construction method of the similarity matrix. Based on granular-ball, the size of the similarity matrix is greatly reduced, and the construction of the similarity matrix is more reasonable. Experimental results show that the proposed algorithm achieves better speedup ratio, less memory consumption and stronger anti noise performance while achieving similar clustering results to the traditional spectral clustering algorithm. Suppose the number of granular-balls is m, n is the number of points in the dataset, and m << n, the time complexity of GBSC is O(m(3)). It is proved that GBSC has good adaptability to large-scale datasets. All codes have been released at https://github.com/xjnine/GBSC.
引用
收藏
页码:9743 / 9753
页数:11
相关论文
共 46 条
[1]  
Asuncion A., 2007, UCI Machine Learning Repository
[2]  
Bai Hao, 2022, 2022 7th International Conference on Cloud Computing and Big Data Analytics (ICCCBDA)., P117, DOI 10.1109/ICCCBDA55098.2022.9778902
[3]  
Baraldi A, 1999, IEEE T SYST MAN CY B, V29, P778, DOI 10.1109/3477.809032
[4]  
Berkhin P, 2006, GROUPING MULTIDIMENSIONAL DATA: RECENT ADVANCES IN CLUSTERING, P25
[5]  
Borra Surekha., 2019, Satellite Image Analysis: Clustering and Classification. SpringerBriefs in Applied Sciences and Technology
[6]   Large Scale Spectral Clustering Via Landmark-Based Sparse Representation [J].
Cai, Deng ;
Chen, Xinlei .
IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (08) :1669-1680
[7]   RSMOTE: A self-adaptive robust SMOTE for imbalanced problems with label noise [J].
Chen, Baiyun ;
Xia, Shuyin ;
Chen, Zizhong ;
Wang, Binggui ;
Wang, Guoyin .
INFORMATION SCIENCES, 2021, 553 :397-428
[8]   A new Nystrom approximation based efficient coherent DOA estimator for radar applications [J].
Dakulagi, Veerendra .
AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2020, 124
[9]  
Ding C., 2001, SPECTRAL MIN MAX CUT
[10]  
McDaid AF, 2013, Arxiv, DOI [arXiv:1110.2515, DOI 10.48550/ARXIV.1110.2515]