Parallel wavelet-based clustering algorithm on GPUs using CUDA

被引:5
作者
Yildirim, Ahmet Artu [1 ]
Ozdogan, Cem [1 ]
机构
[1] Cankaya Univ, Dept Comp Engn, TR-06530 Ankara, Turkey
来源
WORLD CONFERENCE ON INFORMATION TECHNOLOGY (WCIT-2010) | 2011年 / 3卷
关键词
GPU computing; CUDA; cluster analysis; WaveCluster algorithm; GRAPHICS;
D O I
10.1016/j.procs.2010.12.066
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
There has been a substantial interest in scientific and engineering computing community to speed up the CPU-intensive tasks on graphical processing units (GPUs) with the development of many-core GPUs as having very large memory bandwidth and computational power. Cluster analysis is a widely used technique for grouping a set of objects into classes of "similar" objects and commonly used in many fields such as data mining, bioinformatics and pattern recognition. WaveCluster defines the notion of cluster as a dense region consisting of connected components in the transformed feature space. In this study, we present the implementation of WaveCluster algorithm as a novel clustering approach based on wavelet transform to GPU level parallelization and investigate the parallel performance for very large spatial datasets. The CUDA implementations of two main sub-algorithms of WaveCluster approach; namely extraction of low-frequency component from the signal using wavelet transform and connected component labeling are presented. Then, the corresponding performance evaluations are reported for each sub-algorithm. Divide and conquer approach is followed on the implementation of wavelet transform and multi-pass sliding window approach on the implementation of connected component labeling. The maximum achieved speedup is found in kernel as 107x in the computation of extraction of the low-frequency component and 6x in the computation of connected component labeling with respect to the sequential algorithms running on the CPU. (C) 2010 Published by Elsevier Ltd. Selection and/or peer-review under responsibility of the Guest Editor.
引用
收藏
页数:5
相关论文
共 50 条
[21]   Parallel gravitation field algorithm based on the CUDA platform [J].
Rong, Guang ;
Liu, Guixia ;
Zheng, Ming ;
Sun, An ;
Tian, Yuan ;
Wang, Han .
Journal of Information and Computational Science, 2013, 10 (12) :3635-3644
[22]   Mapping the Simulated Annealing Algorithm onto CUDA GPUs [J].
Wei, Kai-Cheng ;
Wu, Chao-Chin ;
Yu, Hui-Liang .
2015 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (ISKE), 2015, :358-365
[23]   Parallel Algorithm of IDCT with GPUs and CUDA for Large-scale Video Quality of 3G [J].
Chen, Qingkui ;
Wang, Haifeng ;
Zhuang, Songlin ;
Liu, Bocheng .
JOURNAL OF COMPUTERS, 2012, 7 (08) :1880-1886
[24]   Accelerating the Red/Black SOR Method Using GPUs with CUDA [J].
Konstantinidis, Elias ;
Cotronis, Yiannis .
PARALLEL PROCESSING AND APPLIED MATHEMATICS, PT I, 2012, 7203 :589-598
[25]   An Efficient Parallel ISODATA Algorithm Based on Kepler GPUs [J].
Yang, Shiquan ;
Dong, Jianqiang ;
Yuan, Bo .
PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, :2444-2449
[26]   Accelerating geospatial analysis on GPUs using CUDA [J].
Ying-jie XIA 1 .
Frontiers of Information Technology & Electronic Engineering, 2011, (12) :990-999
[27]   Accelerating geospatial analysis on GPUs using CUDA [J].
Xia, Ying-jie ;
Kuang, Li ;
Li, Xiu-mei .
JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE C-COMPUTERS & ELECTRONICS, 2011, 12 (12) :990-999
[28]   Parallel Back-Propagation Neural Network Training Technique Using CUDA on Multiple GPUs [J].
Zhang, Shunlu ;
Gunupudi, Pavan ;
Zhang, Qi-Jun .
2015 IEEE MTT-S INTERNATIONAL CONFERENCE ON NUMERICAL ELECTROMAGNETIC AND MULTIPHYSICS MODELING AND OPTIMIZATION (NEMO), 2015,
[29]   Data Parallel density-based genetic clustering on CUDA Architecture [J].
Kroemer, Pavel ;
Platos, Jan ;
Snasel, Vaclav .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2014, 26 (05) :1097-1112
[30]   Accelerating geospatial analysis on GPUs using CUDA [J].
Ying-jie Xia ;
Li Kuang ;
Xiu-mei Li .
Journal of Zhejiang University SCIENCE C, 2011, 12 :990-999