Spectral clustering with linear embedding: A discrete clustering method for large-scale data

被引:7
作者
Gao, Chenhui [1 ]
Chen, Wenzhi [1 ]
Nie, Feiping [2 ]
Yu, Weizhong [2 ]
Wang, Zonghui [1 ]
机构
[1] Zhejiang Univ, Hangzhou 310027, Peoples R China
[2] Northwestern Polytech Univ, Xian 710072, Peoples R China
关键词
Spectral clustering; Graph embedding; Unsupervised learning;
D O I
10.1016/j.patcog.2024.110396
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent decades, spectral clustering has found widespread applications in various real -world scenarios, showcasing its effectiveness. Traditional spectral clustering typically follows a two-step procedure to address the optimization problem. However, this approach may result in substantial information loss and performance decline. Furthermore, the eigenvalue decomposition, a key step in spectral clustering, entails cubic computational complexity. This paper incorporates linear embedding into the objective function of spectral clustering and proposes a direct method to solve the indicator matrix. Moreover, our method achieves a linear time complexity with respect to the input data size. Our method, referred to as Spectral Clustering with Linear Embedding (SCLE), achieves a direct and efficient solution and naturally handles out -of -sample data. SCLE initiates the process with balanced and hierarchical K -means, effectively partitioning the input data into balanced clusters. After generating anchors, we compute a similarity matrix based on the distances between the input data points and the generated anchors. In contrast to the conventional two-step spectral clustering approach, we directly solve the cluster indicator matrix at a linear time complexity. Extensive experiments across multiple datasets underscore the efficiency and effectiveness of our proposed SCLE method.
引用
收藏
页数:11
相关论文
共 34 条
[1]   Spectral clustering via ensemble deep autoencoder learning (SC-EDAE) [J].
Affeldt, Severine ;
Labiod, Lazhar ;
Nadif, Mohamed .
PATTERN RECOGNITION, 2020, 108
[2]  
Akkem Yaganteeswarudu, 2023, International Conference on Innovative Computing and Communications: Proceedings of ICICC 2023. Lecture Notes in Networks and Systems (703), P665, DOI 10.1007/978-981-99-3315-0_51
[3]   Smart farming using artificial intelligence: A review [J].
Akkem, Yaganteeswarudu ;
Biswas, Saroj Kumar ;
Varanasi, Aruna .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 120
[4]   Refining a k-nearest neighbor graph for a computationally efficient spectral clustering [J].
Alshammari, Mashaan ;
Stavrakakis, John ;
Takatsuka, Masahiro .
PATTERN RECOGNITION, 2021, 114
[5]  
[Anonymous], 2006, P 23 INT C MACHINE L, DOI DOI 10.1145/1143844.1143875
[6]  
[Anonymous], 2016, P INT JOINT C ART IN
[8]   Document clustering using locality preserving indexing [J].
Cai, D ;
He, XF ;
Han, JW .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (12) :1624-1637
[9]  
Ding C., 2007, P INT C MACH LEARN
[10]   A min-max cut algorithm for graph partitioning and data clustering [J].
Ding, CHQ ;
He, XF ;
Zha, HY ;
Gu, M ;
Simon, HD .
2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, :107-114