MapReduce FCM clustering set algorithm

被引:0
作者
Mesmin J Mbyamm Kiki
Jianbiao Zhang
Bonzou Adolphe Kouassi
机构
[1] Beijing University of Technology,College of Computer Science and Technology
[2] Beijing University of Technology,College of Electronic Information and Control Engineering
来源
Cluster Computing | 2021年 / 24卷
关键词
MapReduce; Cluster integration; Fuzzy C-means; Parallel; Clustering algorithm;
D O I
暂无
中图分类号
学科分类号
摘要
Fuzzy C-means clustering integration algorithm is a method to improve clustering quality by using integration ideas, but as the amount of data increases, its time complexity increases. A parallel FCM clustering integration algorithm based on MapReduce is proposed. The algorithm uses a random initial clustering centre to obtain differentiated cluster members. By establishing an overlapping matrix between clusters, the clustering labels are unified to find logical equivalence clusters. The cluster members share the classification information of the data objects by voting to obtain the final clustering result. The experimental results show that the parallel FCM clustering integration algorithm has good performance, and has high speedup and good scalability.
引用
收藏
页码:489 / 500
页数:11
相关论文
共 31 条
  • [1] Couceiro M(2016)Novel fractional-order particle swarm optimization Appl. Math. Comput. 283 36-54
  • [2] Sivasundaram S(2018)Evaluation of the genetic algorithm performance for the optimization of the grand potential in the cluster variation method CALPHAD 61 157-164
  • [3] Tamerabet Y(2017)Research on data stream clustering based on fcm algorithm1 Procedia Comput. Sci. 122 595-602
  • [4] Adjadj F(2017)Multigranulation information fusion: a Dempster–Shafer evidence theory-based clustering ensemble method Inf. Sci. 378 389-409
  • [5] Bentrcia T(2016)Handling big data: research challenges and future directions J. Supercomput. 72 1494-1516
  • [6] Gao T(2015)A comprehensive survey of clustering algorithms Ann. Data Sci. 2 165-193
  • [7] Li A(2016)Fuzzy c-means clustering algorithm for directional data (FCM4DD) Expert Syst. Appl. 58 76-82
  • [8] Meng F(2019)Fuzzy clustering: a historical perspective IEEE Comput. Intell. Mag. 14 45-55
  • [9] Li F(2015)MapReduce: simplified data analysis of big data Procedia Comput. Sci, 57 563-571
  • [10] Qian Y(2014)Clustering performance comparison using K-means and expectation-maximization algorithms Biotechnol. Biotechnol. Equip. 28 S44-S48