Distributed Multi-Exemplar Affinity Propagation Based on MapReduce

被引:1
|
作者
Yang, Yu-Bo
Wang, Chang-Dong [1 ]
Lai, Jian-Huang
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou, Guangdong, Peoples R China
来源
2017 THIRD IEEE INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (IEEE BIGDATASERVICE 2017) | 2017年
关键词
Clustering; Multi-exemplar; Affinity propagation; Parallel system; MapReduce; PARALLEL ALGORITHMS;
D O I
10.1109/BigDataService.2017.33
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering algorithm is one of the fundamental techniques in data mining, which plays a crucial role in various applications, such as pattern recognition, document retrieval, and computer vision. As so far, many effective algorithms have been proposed. Affinity Propagation is an algorithm requires no parameter indicating the number of clusters, which is the most distinguishing advantage compared to the k-means clustering algorithm. Multi-Exemplar Affinity Propagation (MEAP) extends the single-exemplar model to the multi-exemplar model, which could describe the dataset with more complex structure. With the amount of data increasing rapidly, the growing size of dataset makes the clustering problem become more and more challenging. To solve this problem, the parallel computing framework is widely used, such as MapReduce. However, for the MEAP algorithm, it is not a straightforward task to implement the updating of MEAP messages in MapReduce, which without proper design would be time-consuming. In this paper, we propose to utilize the stability of data distribution to apply the MEAP algorithm on the MapReduce platform and develop an efficient Distributed Multi-Exemplar Affinity Propagation (DisMEAP) clustering algorithm by using three MapReduce stages. The experiment results demonstrate that our algorithm can perform well in processing large-scale datasets and could achieve the same accuracy as the original MEAP algorithm.
引用
收藏
页码:191 / 197
页数:7
相关论文
共 50 条
  • [1] Multi-Exemplar Affinity Propagation
    Wang, Chang-Dong
    Lai, Jian-Huang
    Suen, Ching Y.
    Zhu, Jun-Yong
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (09) : 2223 - 2237
  • [2] Multi-exemplar affinity propagation clustering based on local density peak
    Shibing Zhou
    Zhewei Chen
    Rao Duan
    Wei Song
    Applied Intelligence, 2024, 54 : 2915 - 2939
  • [3] Multi-exemplar affinity propagation clustering based on local density peak
    Zhou, Shibing
    Chen, Zhewei
    Duan, Rao
    Song, Wei
    APPLIED INTELLIGENCE, 2024, 54 (03) : 2915 - 2939
  • [4] A new color image segmentation algorithm based on mean shift and multi-exemplar affinity propagation
    Xu, Xiaoli
    Yang, Jiagui
    Yin, Hongcai
    ICIC Express Letters, 2015, 9 (09): : 2375 - 2380
  • [5] RETRACTED ARTICLE: Multi-layered multi-exemplar affinity propagation for temporal clustering of human motion
    Shao-Mei Li
    Hui Yu
    Ya-Wen Wang
    Chao Gao
    Multimedia Tools and Applications, 2016, 75 : 16713 - 16713
  • [6] Multi-Exemplar based Clustering for Imbalanced Data
    Wang, Yangtao
    Chen, Lihui
    2014 13TH INTERNATIONAL CONFERENCE ON CONTROL AUTOMATION ROBOTICS & VISION (ICARCV), 2014, : 1068 - 1073
  • [7] Multi-exemplar inhomogeneous texture synthesis
    Park, Hanwook
    Byun, Haewon
    Kim, Changhun
    COMPUTERS & GRAPHICS-UK, 2013, 37 (1-2): : 54 - 64
  • [8] Multi-Exemplar Particle Swarm Optimization
    Song, Wei
    Hua, Ziyu
    IEEE ACCESS, 2020, 8 : 176363 - 176374
  • [9] Visual Object Tracking Based on Multi-exemplar Regression Model
    Zhang Yuanqiang
    Zha Yufei
    Ku Tao
    Wu Min
    Bi Duyan
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2018, 40 (05) : 1202 - 1209
  • [10] RETRACTION: Multi-layered multi-exemplar affinity propagation for temporal clustering of human motion (Retraction of Vol 75, Pg 3422, 2013)
    Li, Shao-Mei
    Yu, Hui
    Wang, Ya-Wen
    Gao, Chao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (23) : 16713 - 16713