Distributed Multi-Exemplar Affinity Propagation Based on MapReduce

被引:1
|
作者
Yang, Yu-Bo
Wang, Chang-Dong [1 ]
Lai, Jian-Huang
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou, Guangdong, Peoples R China
来源
2017 THIRD IEEE INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (IEEE BIGDATASERVICE 2017) | 2017年
关键词
Clustering; Multi-exemplar; Affinity propagation; Parallel system; MapReduce; PARALLEL ALGORITHMS;
D O I
10.1109/BigDataService.2017.33
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering algorithm is one of the fundamental techniques in data mining, which plays a crucial role in various applications, such as pattern recognition, document retrieval, and computer vision. As so far, many effective algorithms have been proposed. Affinity Propagation is an algorithm requires no parameter indicating the number of clusters, which is the most distinguishing advantage compared to the k-means clustering algorithm. Multi-Exemplar Affinity Propagation (MEAP) extends the single-exemplar model to the multi-exemplar model, which could describe the dataset with more complex structure. With the amount of data increasing rapidly, the growing size of dataset makes the clustering problem become more and more challenging. To solve this problem, the parallel computing framework is widely used, such as MapReduce. However, for the MEAP algorithm, it is not a straightforward task to implement the updating of MEAP messages in MapReduce, which without proper design would be time-consuming. In this paper, we propose to utilize the stability of data distribution to apply the MEAP algorithm on the MapReduce platform and develop an efficient Distributed Multi-Exemplar Affinity Propagation (DisMEAP) clustering algorithm by using three MapReduce stages. The experiment results demonstrate that our algorithm can perform well in processing large-scale datasets and could achieve the same accuracy as the original MEAP algorithm.
引用
收藏
页码:191 / 197
页数:7
相关论文
共 50 条
  • [31] Distributed Whale Optimization Algorithm based on MapReduce
    Khalil, Yasser
    Alshayeji, Mohammad
    Ahmad, Imtiaz
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (01):
  • [32] A Distributed SVM Method based on the Iterative MapReduce
    Ke, Xijiang
    Jin, Hai
    Xie, Xia
    Cao, Jie
    2015 IEEE 9TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2015, : 116 - 119
  • [33] A temporal based approach for MapReduce distributed testing
    Hsaini, Sara
    Azzouzi, Salma
    Charaf, My El Hassan
    INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2021, 36 (04) : 293 - 311
  • [34] Distributed media indexing based on MPI and MapReduce
    Hisham Mohamed
    Stéphane Marchand-Maillet
    Multimedia Tools and Applications, 2014, 69 : 513 - 537
  • [35] Band Selection Algorithm Based on Multi-Feature and Affinity Propagation Clustering
    Zhuang, Junbin
    Chen, Wenying
    Huang, Xunan
    Yan, Yunyi
    REMOTE SENSING, 2025, 17 (02)
  • [36] Multi-objective Differential Evolution Algorithm Based on Affinity Propagation Clustering
    Qu, Dan
    Li, Hongyi
    Chen, Huafei
    IAENG International Journal of Applied Mathematics, 2023, 53 (04)
  • [37] Disaster Image Filtering and Summarization Based on Multi-layered Affinity Propagation
    Yang, Yimin
    Chen, Shu-Ching
    2012 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2012, : 100 - 103
  • [38] A New Distributed Name Disambiguation System Based on MapReduce
    Liu Pengfei
    Ge Sheng
    PROCEEDINGS OF 2012 IEEE 14TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY, 2012, : 550 - 554
  • [39] A Distributed Abnormal Packet Generation Engine Based on MapReduce
    Zhang Qi-fei
    Lv Hong-bin
    Pan Xue-zeng
    Wang Chao
    Li Wen-juan
    2012 19TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2012,
  • [40] RECONSTRUCTION OF IMAGES WITH EXEMPLAR BASED IMAGE INPAINTING AND PATCH PROPAGATION
    Ishi, Manoj S.
    Singh, Lokesh
    Agrawal, Manish
    2014 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2014,