Fast Approximated Multiple Kernel K-Means

被引:14
作者
Wang, Jun [1 ]
Tang, Chang [2 ,3 ]
Zheng, Xiao [1 ]
Liu, Xinwang [1 ]
Zhang, Wei [4 ]
Zhu, En [1 ]
Zhu, Xinzhong [5 ]
机构
[1] Natl Univ Def Technol, Sch Comp, Changsha 410073, Peoples R China
[2] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Peoples R China
[3] Xidian Univ, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
[4] Qilu Univ Technol, Shandong Comp Sci Ctr, Natl Supercomp Ctr Jinan, Shandong Prov Key Lab Comp Networks,Shandong Acad, Jinan 250000, Peoples R China
[5] Zhejiang Normal Univ, Sch Comp Sci & Technol, Sch Artificial Intelligence, Jinhua 321004, Peoples R China
基金
中国国家自然科学基金;
关键词
Kernel; Clustering algorithms; Partitioning algorithms; Computational complexity; Optimization; Linear programming; Matrix decomposition; Multi-view clustering; partition learning; multiple kernel k -means; data fusion;
D O I
10.1109/TKDE.2023.3340743
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multiple Kernel Clustering (MKC) has emerged as a prominent research domain in recent decades due to its capacity to exploit diverse information from multiple views by learning an optimal kernel. Despite the successes achieved by various MKC methods, a significant challenge lies in the computational complexity associated with generating a consensus partition from the optimal kernel matrix, typically of size n x n , where n represents the number of samples. This computational bottleneck restricts the practical applicability of these methods when confronted with large-scale datasets. Furthermore, certain existing MKC algorithms derive the consensus partition matrix by fusing all base partitions. However, this fusion process may inadvertently overlook critical information embedded in individual base kernels, potentially leading to inferior clustering performance. In light of these challenges, we introduce an innovative and efficient multiple kernel k -means approach, denoted as FAMKKM. Notably, FAMKKM incorporates two approximated partition matrices instead of the original individual partition matric for each base kernel. This strategic substitution significantly reduces computational complexity. Additionally, FAMKKM leverages the original kernel information to guide the fusion of all base partitions, thereby enhancing the quality of the resulting consensus partition matrix. Finally, we substantiate the efficacy and efficiency of the proposed FAMKKM through extensive experiments conducted on six benchmark datasets. Our results demonstrate its superiority over state-of-the-art methods.
引用
收藏
页码:6171 / 6180
页数:10
相关论文
共 53 条
[1]  
[Anonymous], 2011, P 17 ACM SIGKDD INT
[2]  
[Anonymous], 2016, P INT JOINT C ART IN
[3]  
[Anonymous], 2004, P 10 ACM SIGKDD INT, DOI [10.1145/1014052.1014118, DOI 10.1145/1014052.101411]
[4]   Multiview Clustering: A Late Fusion Approach Using Latent Models [J].
Bruno, Eric ;
Marchand-Maillet, Stephane .
PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, :736-737
[5]  
Chen MS, 2020, AAAI CONF ARTIF INTE, V34, P3513
[6]  
Dash Rajashree., 2010, INT J ENG SCI TECHNO, V2, P59
[7]  
Hartigan J. A., 1979, Applied Statistics, V28, P100, DOI 10.2307/2346830
[8]   scDFC: A deep fusion clustering method for single-cell RNA-seq data [J].
Hu, Dayu ;
Liang, Ke ;
Zhou, Sihang ;
Tu, Wenxuan ;
Liu, Meng ;
Liu, Xinwang .
BRIEFINGS IN BIOINFORMATICS, 2023, 24 (04)
[9]   Multiple Kernel Fuzzy Clustering [J].
Huang, Hsin-Chien ;
Chuang, Yung-Yu ;
Chen, Chu-Song .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2012, 20 (01) :120-134
[10]  
Huang J., 2013, P AAAI C ART INT, V27, P431