Distributed Clustering in Wireless Sensor Network with Kernel Based Weighted Fuzzy C-Means Algorithm

被引:0
作者
Anita Panwar [1 ]
Satyasai Jagannath Nanda [1 ]
机构
[1] Department of Electronics and Communication Engineering, Malaviya National Institute of Technology, Rajasthan, Jaipur
关键词
Cluster weight; Distributed clustering; Feature weight; Fuzzy C-means; Gaussian kernel;
D O I
10.1007/s42979-024-03446-4
中图分类号
学科分类号
摘要
The main limitation of the Fuzzy C-Means technique is its sensitivity to noise and outliers, which limits its use in adverse clustering scenarios. The new framework reported in this manuscript is based on kernel-induced distance matric using the Gaussian Radial Basis Function (GRBF), and the proposed algorithm is named as Distributed Kernel-based Weighted Fuzzy C-Means (DKWFCM) algorithm. In wireless sensor networks (WSN), Distributed approaches perform the clustering task across network nodes, mitigating privacy risks, reducing communication overhead, and adapting to network dynamics. Feature-weight and cluster-weight learning are incorporated for more effective cluster analysis in distributed environments. Additionally, DKWFCM leverages diffusion-based learning to enable information processing across multiple wireless sensor nodes. The proposed algorithm DKWFCM performance is evaluated on synthetic and real-world datasets distributed over six wireless sensor nodes. Five datasets for validation which consists of two Synthetic datasets (Circle_3_2, Mixed_3_2) and three real-world datasets (the Cook Agricultural land dataset, Thames River water quality dataset, and Canada Weather station dataset). Performance is assessed using the Silhouette Index (SI) and Dunn Index (DI) as a validation measures. Simulation results demonstrated the superior performance of DKWFCM over traditional FCM algorithms such as Distributed FCM (DFCM) algorithm and Distributed Weighted FCM (DWFCM). The superiority is evident in various aspects, such as visual clusters obtained at each node, SI value, DI value plots at the sensor nodes, and average convergence plots. The minimum value of average Euclidean deviation of proposed DKWFCM optimization algorithm is reduced by 27.73%; 31.86% and 10.11%; 19.40% compared to DFCM and DWFCM respectively for both synthetic datasets. Similarly, it is reduced by 99.12%; 97.13%; 54.29% and 5.69%; 91.67%; 30.41% respectively for the three real-world datasets. These findings suggest that the proposed algorithm DKWFCM improves cluster analysis in distributed processing environments of WSNs. © The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd. 2024.
引用
收藏
相关论文
共 42 条
[1]  
Altilio R., Di Lorenzo P., Panella M., Distributed data clustering over networks, Pattern Recognit, 93, pp. 603-620, (2019)
[2]  
Shigei N., Miyajima H., Morishita H., Maedain M., Proceedings of the International Multiconference of Engineers and Computer Scientists, 1, pp. 18-20, (2009)
[3]  
Yang J., Lin C.T., Toward autonomous distributed clustering, IEEE Trans Emerg Top Comput Intell, (2024)
[4]  
Ghosal A., Halder S., Das S.K., Distributed on-demand clustering algorithm for lifetime optimization in wireless sensor networks, J Parallel Distrib Comput, 141, pp. 129-142, (2020)
[5]  
Pedrycz W., Federated FCM: clustering under privacy requirements, IEEE Trans Fuzzy Syst, 30, 8, pp. 3384-3388, (2021)
[6]  
Hashemi S.E., Gholian-Jouybari F., Hajiaghaei-Keshteli M., A fuzzy C-means algorithm for optimizing data clustering, Expert Syst Appl, 227, 120, (2023)
[7]  
Datta S., Giannella C., Kargupta H., Approximate distributed K-means clustering over a peer-to-peer network, IEEE Trans Knowl Data Eng, 21, 10, pp. 1372-1388, (2008)
[8]  
Zhao K., Dai Y., Jia Z., Ji Y., General fuzzy C-means clustering algorithm using Minkowski metric, Signal Process, 188, 108, (2021)
[9]  
Askari S., Fuzzy C-means clustering algorithm for data with unequal cluster sizes and contaminated with noise and outliers: review and development, Expert Syst Appl, 165, 113, (2021)
[10]  
Nayak J., Naik B., Behera H., Computational Intelligence in Data Mining-Volume 2: Proceedings of the International Conference on CIDM, 20–21 December 2014, pp. 133-149, (2015)