Distributed Multi-Exemplar Affinity Propagation Based on MapReduce

被引:1
|
作者
Yang, Yu-Bo
Wang, Chang-Dong [1 ]
Lai, Jian-Huang
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou, Guangdong, Peoples R China
来源
2017 THIRD IEEE INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (IEEE BIGDATASERVICE 2017) | 2017年
关键词
Clustering; Multi-exemplar; Affinity propagation; Parallel system; MapReduce; PARALLEL ALGORITHMS;
D O I
10.1109/BigDataService.2017.33
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering algorithm is one of the fundamental techniques in data mining, which plays a crucial role in various applications, such as pattern recognition, document retrieval, and computer vision. As so far, many effective algorithms have been proposed. Affinity Propagation is an algorithm requires no parameter indicating the number of clusters, which is the most distinguishing advantage compared to the k-means clustering algorithm. Multi-Exemplar Affinity Propagation (MEAP) extends the single-exemplar model to the multi-exemplar model, which could describe the dataset with more complex structure. With the amount of data increasing rapidly, the growing size of dataset makes the clustering problem become more and more challenging. To solve this problem, the parallel computing framework is widely used, such as MapReduce. However, for the MEAP algorithm, it is not a straightforward task to implement the updating of MEAP messages in MapReduce, which without proper design would be time-consuming. In this paper, we propose to utilize the stability of data distribution to apply the MEAP algorithm on the MapReduce platform and develop an efficient Distributed Multi-Exemplar Affinity Propagation (DisMEAP) clustering algorithm by using three MapReduce stages. The experiment results demonstrate that our algorithm can perform well in processing large-scale datasets and could achieve the same accuracy as the original MEAP algorithm.
引用
收藏
页码:191 / 197
页数:7
相关论文
共 50 条
  • [21] Affinity propagation: An exemplar-based tool for clustering in psychological research
    Brusco, Michael J.
    Steinley, Douglas
    Stevens, Jordan
    Cradit, J. Dennis
    BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2019, 72 (01): : 155 - 182
  • [22] Enhanced Blind Face Restoration with Multi-Exemplar Images and Adaptive Spatial Feature Fusion
    Li, Xiaoming
    Li, Wenyu
    Ren, Dongwei
    Zhang, Hongzhi
    Wang, Meng
    Zuo, Wangmeng
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2703 - 2712
  • [23] Distributed and Incremental Clustering Based on Weighted Affinity Propagation
    Zhang, Xiangliang
    Furtlehner, Cyril
    Sebag, Michele
    STAIRS 2008, 2008, 179 : 199 - +
  • [24] Multi-Exemplar Learning Particle Swarm Optimization for Regional Traffic Signal Timing Optimization with Multi-Intersections
    Deng, Zhuang-Jie
    Zhan, Zhi-Hui
    Kwong, Sam
    Zhang, Jun
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 2918 - 2923
  • [25] Automatic Aggregation Enhanced Affinity Propagation Clustering Based on Mutually Exclusive Exemplar Processing
    Ouyang, Zhihong
    Xue, Lei
    Ding, Feng
    Duan, Yongsheng
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (01): : 983 - 1008
  • [26] Knowledge Embedding-Assisted Multi-Exemplar Learning Particle Swarm Optimization for Traffic Signal Timing Optimization
    Deng, Zhuang-Jie
    Luo, Liu-Yue
    Zhan, Zhi-Hui
    Zhang, Jun
    2021 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC 2021), 2021, : 248 - 255
  • [27] Distributed Video Transcoding Based on MapReduce
    Song, Chenwei
    Shen, Wenfeng
    Sun, Lianqiang
    Lei, Zhou
    Xu, Weimin
    2014 IEEE/ACIS 13TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2014, : 303 - 308
  • [28] DISTRIBUTED ROUTING IN NETWORKS USING AFFINITY PROPAGATION
    Shamaiah, Manohar
    Lee, Sang Hyun
    Vishwanath, Sriram
    Vikalo, Haris
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 3036 - 3039
  • [29] Distributed media indexing based on MPI and MapReduce
    Mohamed, Hisham
    Marchand-Maillet, Stephane
    MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 69 (02) : 513 - 537
  • [30] Performance Evaluation for Distributed Join Based on MapReduce
    Zhang, Jingwei
    Yang, Qing
    Shang, Hongjia
    Zhang, Huibing
    Lin, Yuming
    Zhou, Rui
    2016 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA (CCBD), 2016, : 295 - 301