MapReduce FCM clustering set algorithm

被引:12
作者
Kiki, Mesmin J. Mbyamm [1 ]
Zhang, Jianbiao [2 ]
Kouassi, Bonzou Adolphe [2 ]
机构
[1] Beijing Univ Technol, Coll Comp Sci & Technol, Beijing 100124, Peoples R China
[2] Beijing Univ Technol, Coll Elect Informat & Control Engn, Beijing 100124, Peoples R China
来源
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS | 2021年 / 24卷 / 01期
关键词
MapReduce; Cluster integration; Fuzzy C-means; Parallel; Clustering algorithm; PERFORMANCE;
D O I
10.1007/s10586-020-03131-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fuzzy C-means clustering integration algorithm is a method to improve clustering quality by using integration ideas, but as the amount of data increases, its time complexity increases. A parallel FCM clustering integration algorithm based on MapReduce is proposed. The algorithm uses a random initial clustering centre to obtain differentiated cluster members. By establishing an overlapping matrix between clusters, the clustering labels are unified to find logical equivalence clusters. The cluster members share the classification information of the data objects by voting to obtain the final clustering result. The experimental results show that the parallel FCM clustering integration algorithm has good performance, and has high speedup and good scalability.
引用
收藏
页码:489 / 500
页数:12
相关论文
共 17 条
[1]   Handling big data: research challenges and future directions [J].
Anagnostopoulos, I. ;
Zeadally, S. ;
Exposito, E. .
JOURNAL OF SUPERCOMPUTING, 2016, 72 (04) :1494-1516
[2]  
Bhavani R., 2018, MED BIG DATA INTERNE, P189
[3]   Novel fractional order particle swarm optimization [J].
Couceiro, Micael ;
Sivasundaram, Seenith .
APPLIED MATHEMATICS AND COMPUTATION, 2016, 283 :36-54
[4]   Research on Data Stream Clustering Based on FCM Algorithm [J].
Gao, Tiancheng ;
Li, Aihua ;
Meng, Fan .
5TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT, ITQM 2017, 2017, 122 :595-602
[5]   Mapreduce performance model for Hadoop 2.x [J].
Glushkova, Dada ;
Jovanovic, Petar ;
Abello, Alberto .
INFORMATION SYSTEMS, 2019, 79 :32-43
[6]  
Jin S., 2016, ARXIV160806347
[7]   Clustering performance comparison using K-means and expectation maximization algorithms [J].
Jung, Yong Gyu ;
Kang, Min Soo ;
Heo, Jun .
BIOTECHNOLOGY & BIOTECHNOLOGICAL EQUIPMENT, 2014, 28 :S44-S48
[8]   Fuzzy c-means clustering algorithm for directional data (FCM4DD) [J].
Kesemen, Orhan ;
Tezel, Ozge ;
Ozkul, Eda .
EXPERT SYSTEMS WITH APPLICATIONS, 2016, 58 :76-82
[9]   Multigranulation information fusion: A Dempster-Shafer evidence theory-based clustering ensemble method [J].
Li, Feijiang ;
Qian, Yuhua ;
Wang, Jieting ;
Liang, Jiye .
INFORMATION SCIENCES, 2017, 378 :389-409
[10]   MapReduce-based fuzzy c-means clustering algorithm: implementation and scalability [J].
Ludwig, Simone A. .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2015, 6 (06) :923-934