An efficient parallel clustering algorithm for large scale database

被引:0
|
作者
School of Electronic Information, Wuhan University, Wuhan, Hubei, China [1 ]
不详 [2 ]
不详 [3 ]
机构
[1] School of Electronic Information, Wuhan University, Wuhan, Hubei
[2] Hubei Bureau of Surveying and Mapping, Wuhan, Hubei
[3] PRC Education, Intel China Ltd., Shanghai
来源
J. Softw. | 2009年 / 10卷 / 1119-1126期
关键词
Clustering; Parallel pattern; Parallel processing; Performance analysis; SLPP; SLPPCA;
D O I
10.4304/jsw.4.10.1119-1126
中图分类号
学科分类号
摘要
In this paper, we propose a new parallel clustering algorithm, named Stem-Leaf-Point Plot Clustering Algorithm (SLPPCA). SLPPCA tends to produce clusters of different shapes and sizes, and according to our experiments, it can produces clusters more efficiently than traditional methods. SLPPCA can fully exploits the data-parallelism of data objects, and adopts a task decomposition design step to balance the workloads of multi-core processors to achieve a high speedup. We implemented SLPPCA to large scale data base on duo-core processor and quad-core processor based computer separately and analyzed its performance. The experimental results show that the clusters it produced were particularly good either in different density or shapes, furthermore, with the parallel pattern used in SLPPCA on multi-core platform, the speedup was almost linear with the numbers of cores in processor and the number of data points. Moreover, SLPPCA can generate satisfactory cluster number automatically in clustering process. © 2009 Academy Publisher.
引用
收藏
页码:1119 / 1126
页数:7
相关论文
共 50 条
  • [21] A PARALLEL DOMAIN DECOMPOSITION ALGORITHM FOR LARGE SCALE IMAGE DENOISING
    Chen, Rongliang
    Huang, Jizu
    Cai, Xiao-Chuan
    INVERSE PROBLEMS AND IMAGING, 2019, 13 (06) : 1259 - 1282
  • [22] Efficient Classifying and Indexing for Large Iris Database Based on Enhanced Clustering Method
    Khalaf, Emad Taha
    Mohammad, Muamer N.
    Moorthy, Kohbalan
    Khalaf, Ahmad Taha
    STUDIES IN INFORMATICS AND CONTROL, 2018, 27 (02): : 191 - 200
  • [23] A Parallel Clustering Algorithm for Placement
    Momeni, Amir
    Mistry, Perhaad
    Kaeli, David
    PROCEEDINGS OF THE FIFTEENTH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2014), 2015, : 349 - 356
  • [24] Rapid Trend Prediction for Large-Scale Cloud Database KPIs by Clustering
    Wang, Xiaoling
    Li, Ning
    Zhang, Lijun
    Zhang, Xiaofang
    Zhao, Qiong
    2021 IEEE/ACM INTERNATIONAL WORKSHOP ON CLOUD INTELLIGENCE (CLOUDINTELLIGENCE 2021), 2021, : 1 - 6
  • [25] An efficient clustering algorithm
    Jiang, SY
    Xu, YM
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1513 - 1518
  • [26] Parallel algorithms for clustering high-dimensional large-scale datasets
    Nagesh, H
    Goil, S
    Choudhary, A
    DATA MINING FOR SCIENTIFIC AND ENGINEERING APPLICATIONS, 2001, 2 : 335 - 356
  • [27] Efficient parallel implementation of a density peaks clustering algorithm on graphics processing unit
    Ge, Ke-shi
    Su, Hua-you
    Li, Dong-sheng
    Lu, Xi-cheng
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2017, 18 (07) : 915 - 927
  • [28] Efficient parallel implementation of a density peaks clustering algorithm on graphics processing unit
    Ke-shi Ge
    Hua-you Su
    Dong-sheng Li
    Xi-cheng Lu
    Frontiers of Information Technology & Electronic Engineering, 2017, 18 : 915 - 927
  • [29] An efficient clustering algorithm
    Zhang, YF
    Mao, JL
    Xiong, ZY
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 261 - 265
  • [30] Energy Efficient Clustering Protocol for Large-Scale Sensor Networks
    Lin, Hai
    Wang, Lusheng
    Kong, Ruoshan
    IEEE SENSORS JOURNAL, 2015, 15 (12) : 7150 - 7160