Random Feature-Based Collaborative Kernel Fuzzy Clustering for Distributed Peer-to-Peer Networks

被引:6
作者
Wang, Yingxu [1 ]
Han, Shiyuan [1 ]
Zhou, Jin [1 ]
Chen, Long [2 ]
Chen, C. L. Philip [3 ]
Zhang, Tong [3 ]
Liu, Zhulin [3 ]
Wang, Lin [1 ]
Chen, Yuehui [1 ]
机构
[1] Univ Jinan, Shandong Prov Key Lab Network Based Intelligent Co, Jinan 250022, Peoples R China
[2] Univ Macau, Fac Sci & Technol, Dept Comp & Informat Sci, Macau 999078, Peoples R China
[3] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510641, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Kernel; Distributed databases; Peer-to-peer computing; Collaboration; Clustering algorithms; Clustering methods; Prototypes; Collaborative distributed clustering; feature weights; kernel fuzzy clustering; random Fourier feature; C-MEANS; ALGORITHM;
D O I
10.1109/TFUZZ.2022.3188363
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Kernel clustering has the ability to get the inherent nonlinear structure of the data. But the high computational complexity and the unknown representation of the kernel space make it unavailable for the data clustering in distributed peer-to-peer (P2P) networks. To solve this issue, we propose a new series of random feature-based collaborative kernel clustering algorithms in this article. In the most basic algorithm, each node in a distributed P2P network first maps its data into a low-dimensional random feature space with the approximation of the given kernel by using the random Fourier feature mapping method. Then, each node independently searches the clusters with its local data and the collaborative knowledge from its neighbor nodes, and the distributed clustering is performed among all network nodes until reaching the global consensus result, i.e., all nodes have the same cluster centers. In addition, an improved version is designed with assignment of feature weights, which is optimized by the maximum-entropy technique to extract important features for the cluster identification. What's more, to relief the impact of different kernel functions and related parameters on clustering results, the combination of multiple kernels rather than a single kernel is adopted for the low-dimensional approximation, and the optimized weights are assigned to provide the guidance on the choice of the kernels and their parameters and discover significant features at the same time. Experiments on synthetic and real-world datasets show that the proposed methods achieve similar and even better results than the traditional kernel clustering methods on various performance metrics, including the average classification rate, the average normalized mutual information, and the average adjusted rand index. More importantly, the low-dimensional random features approximated to kernels and the distributed clustering mechanism adopted in these methods bring the greatly lower temporal complexity.
引用
收藏
页码:692 / 706
页数:15
相关论文
共 48 条
[1]  
Achlioptas D, 2002, ADV NEUR IN, V14, P335
[2]   Smart Identification of Topographically Variant Anomalies in Brain Magnetic Resonance Imaging Using a Fish School-Based Fuzzy Clustering Approach [J].
Alagarsamy, Saravanan ;
Zhang, Yu-Dong ;
Govindaraj, Vishnuvarthanan ;
Rajasekaran, Murugan Pallikonda ;
Sankaran, Sakthivel .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2021, 29 (10) :3165-3177
[3]  
[Anonymous], 2020, SOFT COMPUT J, V96
[4]  
[Anonymous], 2019, INT J FUZZY SYST, V21, P2132
[5]   A Space Efficient Minimum Spanning Tree Approach to the Fuzzy Joint Points Clustering Algorithm [J].
Atilgan, Can ;
Nasibov, Efendi N. .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2019, 27 (06) :1317-1322
[6]  
Avron H, 2016, J MACH LEARN RES, V17
[7]  
Bache K, 2013, UCI machine learning repository
[8]  
Bezdek J. C., 1981, Pattern recognition with fuzzy objective function algorithms
[9]   Online Distributed Learning Over Networks in RKH Spaces Using Random Fourier Features [J].
Bouboulis, Pantelis ;
Chouvardas, Symeon ;
Theodoridis, Sergios .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2018, 66 (07) :1920-1932
[10]  
Camastra F, 2006, INT C PATT RECOG, P913