A Fast Parallel Selection Algorithm on GPUs

被引:5
|
作者
Bakunas-Milanowski, Darius [1 ]
Rego, Vernon [2 ]
Sang, Janche [1 ]
Yu, Chansu [1 ]
机构
[1] Cleveland State Univ, Dept Elect Engn & Comp Sci, Cleveland, OH 44115 USA
[2] Purdue Univ, Dept Comp Sci, W Lafayette, IN USA
关键词
Parallel Selection; CUDA Thrust Library; GPU; Optimization Techniques; SIMT;
D O I
10.1109/CSCI.2015.132
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Today, parallel selection algorithms that run on Graphical Processing Units (GPUs) hold great promise in providing even more computational power than that of conventional CPUs. To quantify these gains, we examined a new parallel selection algorithm to see exactly what its vast number of simple, data parallel, multithreaded cores meant for performance times, using the current generation of NVIDIA GPUs. Specifically, our team tested how we could utilize a GPU to select elements from a massive array that met specific criteria and store their indices in a target array for additional processing. In this paper, we report optimization techniques and road blocks encountered. Overall, the experimental results demonstrate that our implementation performs an average of 3.67 times faster than Thrust, an open-source parallel algorithms library.
引用
收藏
页码:609 / 614
页数:6
相关论文
共 50 条
  • [21] Parallel algorithm for evolvable-based boolean synthesis on GPUs
    Vitola, Jaime
    Sanabria, Adriana
    Pedraza, Cesar
    Sepulveda, Johanna
    ANALOG INTEGRATED CIRCUITS AND SIGNAL PROCESSING, 2013, 76 (03) : 335 - 342
  • [22] Parallel Shellsort Algorithm for Many-Core GPUs with CUDA
    Lin, Chun-Yuan
    Lee, Wei Sheng
    Tang, Chuan Yi
    INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2012, 4 (02) : 1 - 16
  • [23] An Efficient Parallel Algorithm for Longest Common Subsequence Problem on GPUs
    Yang, Jiaoyun
    Xu, Yun
    Shang, Yi
    WORLD CONGRESS ON ENGINEERING, WCE 2010, VOL I, 2010, : 499 - 504
  • [24] Parallel Implementation of Cryptographic Algorithm: AES Using OpenCL on GPUs
    Inampudi, Govardhana Rao
    Shyamala, K.
    Ramachandram, S.
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INVENTIVE SYSTEMS AND CONTROL (ICISC 2018), 2018, : 984 - 988
  • [25] Parallel algorithm for evolvable-based boolean synthesis on GPUs
    Jaime Vitola
    Adriana Sanabria
    César Pedraza
    Johanna Sepúlveda
    Analog Integrated Circuits and Signal Processing, 2013, 76 : 335 - 342
  • [26] TileSpTRSV: a tiled algorithm for parallel sparse triangular solve on GPUs
    Lu, Zhengyang
    Liu, Weifeng
    CCF TRANSACTIONS ON HIGH PERFORMANCE COMPUTING, 2023, 5 (02) : 129 - 143
  • [27] Efficient Parallel Algorithm for Compound Comparisons on Multi-GPUs
    Lin, Chun-Yuan
    Wang, Chung-Hung
    Hung, Che-Lun
    Lin, Yu-Shiang
    2014 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2014,
  • [28] TileSpTRSV: a tiled algorithm for parallel sparse triangular solve on GPUs
    Zhengyang Lu
    Weifeng Liu
    CCF Transactions on High Performance Computing, 2023, 5 : 129 - 143
  • [29] A New Algorithm for Parallel Connected-Component Labelling on GPUs
    Playne, Daniel Peter
    Hawick, Ken
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 29 (06) : 1217 - 1230
  • [30] Accelerating Pattern Matching Using a Novel Parallel Algorithm on GPUs
    Lin, Cheng-Hung
    Liu, Chen-Hsiung
    Chien, Lung-Sheng
    Chang, Shih-Chieh
    IEEE TRANSACTIONS ON COMPUTERS, 2013, 62 (10) : 1906 - 1916