A Fast Parallel Selection Algorithm on GPUs

被引:5
|
作者
Bakunas-Milanowski, Darius [1 ]
Rego, Vernon [2 ]
Sang, Janche [1 ]
Yu, Chansu [1 ]
机构
[1] Cleveland State Univ, Dept Elect Engn & Comp Sci, Cleveland, OH 44115 USA
[2] Purdue Univ, Dept Comp Sci, W Lafayette, IN USA
关键词
Parallel Selection; CUDA Thrust Library; GPU; Optimization Techniques; SIMT;
D O I
10.1109/CSCI.2015.132
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Today, parallel selection algorithms that run on Graphical Processing Units (GPUs) hold great promise in providing even more computational power than that of conventional CPUs. To quantify these gains, we examined a new parallel selection algorithm to see exactly what its vast number of simple, data parallel, multithreaded cores meant for performance times, using the current generation of NVIDIA GPUs. Specifically, our team tested how we could utilize a GPU to select elements from a massive array that met specific criteria and store their indices in a target array for additional processing. In this paper, we report optimization techniques and road blocks encountered. Overall, the experimental results demonstrate that our implementation performs an average of 3.67 times faster than Thrust, an open-source parallel algorithms library.
引用
收藏
页码:609 / 614
页数:6
相关论文
共 50 条
  • [41] Parallel VINS-Mono algorithm based on GPUs in embedded devices
    Lu, Quan
    Xu, Jianli
    Hu, Likun
    Shi, Minghui
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2022, 19 (01)
  • [42] A Node-based Parallel Game Tree Algorithm Using GPUs
    Li, Liang
    Liu, Hong
    Liu, Peiyu
    Liu, Taoying
    Li, Wei
    Wang, Hao
    2012 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2012, : 18 - 26
  • [43] A massively parallel algorithm for Bordered Almost Block Diagonal Systems on GPUs
    Dessole, M.
    Marcuzzi, F.
    NUMERICAL ALGORITHMS, 2021, 86 (03) : 1243 - 1263
  • [44] Parallel Computing with GPUs
    Elster, Anne C.
    Requena, Stephane
    PARALLEL COMPUTING: FROM MULTICORES AND GPU'S TO PETASCALE, 2010, 19 : 533 - 535
  • [45] Parallel Programming with GPUs
    Kinayman, Noyan
    IEEE MICROWAVE MAGAZINE, 2013, 14 (04) : 102 - 115
  • [46] Petascale turbulence simulation using a highly parallel fast multipole method on GPUs
    Yokota, Rio
    Barba, L. A.
    Narumi, Tetsu
    Yasuoka, Kenji
    COMPUTER PHYSICS COMMUNICATIONS, 2013, 184 (03) : 445 - 455
  • [47] Fast Parallel Expectation Maximization for Gaussian Mixture Models on GPUs using CUDA
    Kumar, N. S. L. Phani
    Satoor, Sanjiv
    Buck, Ian
    HPCC: 2009 11TH IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2009, : 103 - 109
  • [48] Fast Hough Transform on GPUs: Exploration of Algorithm Trade-Offs
    van den Braak, Gert-Jan
    Nugteren, Cedric
    Mesman, Bart
    Corporaal, Henk
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, 2011, 6915 : 611 - 622
  • [49] Fast training algorithm for deep neural network using multiple GPUs
    Dai, L. (lrdai@ustc.edu.cn), 1600, Tsinghua University (53):
  • [50] Fast parallel algorithm for polynomial interpolation
    Jana, P.K.
    Sinha, B.P.
    Computers and Mathematics with Applications, 1995, 29 (04): : 85 - 92