The BiConjugate gradient method on GPUs

被引:12
作者
Ortega, G. [1 ]
Garzon, E. M. [1 ]
Vazquez, F. [1 ]
Garcia, I. [2 ]
机构
[1] Univ Almeria, Dpt Comput Archit & Electron, Almeria 04120, Spain
[2] Univ Malaga, Dpt Comput Archit, E-29071 Malaga, Spain
关键词
BiConjugate gradient method; GPU computing; Parallel computing; Linear system of equations;
D O I
10.1007/s11227-012-0761-2
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In a wide variety of applications from different scientific and engineering fields, the solution of complex and/or nonsymmetric linear systems of equations is required. To solve this kind of linear systems the BiConjugate Gradient method (BCG) is especially relevant. Nevertheless, BCG has a enormous computational cost. GPU computing is useful for accelerating this kind of algorithms but it is necessary to develop suitable implementations to optimally exploit the GPU architecture. In this paper, we show how BCG can be effectively accelerated when all operations are computed on a GPU. So, BCG has been implemented with two alternative routines of the Sparse Matrix Vector product (SpMV): the CUSPARSE library and the ELLR-T routine. Although our interest is focused on complex matrices, our implementation has been evaluated on a GPU for two sets of test matrices: complex and real, in single and double precision data. Experimental results show that BCG based on ELLR-T routine achieves the best performance, particularly for the set of complex test matrices. Consequently, this method can be useful as a tool to efficiently solve large linear system of equations (complex and/or nonsymmetric) involved in a broad range of applications.
引用
收藏
页码:49 / 58
页数:10
相关论文
共 50 条
  • [1] The BiConjugate gradient method on GPUs
    G. Ortega
    E. M. Garzón
    F. Vázquez
    I. García
    The Journal of Supercomputing, 2013, 64 : 49 - 58
  • [2] Parallel Power Flow Solutions Using a Biconjugate Gradient Algorithm and a Newton Method: a GPU-Based Approach
    Garcia, Norberto
    IEEE POWER AND ENERGY SOCIETY GENERAL MEETING 2010, 2010,
  • [3] Performance analysis of the high-performance conjugate gradient benchmark on GPUs
    Phillips, Everett
    Fatica, Massimiliano
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2016, 30 (01) : 28 - 38
  • [4] CuMF_SGD: Parallelized Stochastic Gradient Descent for Matrix Factorization on GPUs
    Xie, Xiaolong
    Tan, Wei
    Fong, Liana L.
    Liang, Yun
    HPDC'17: PROCEEDINGS OF THE 26TH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE PARALLEL AND DISTRIBUTED COMPUTING, 2017, : 79 - 92
  • [5] Accelerating the Red/Black SOR Method Using GPUs with CUDA
    Konstantinidis, Elias
    Cotronis, Yiannis
    PARALLEL PROCESSING AND APPLIED MATHEMATICS, PT I, 2012, 7203 : 589 - 598
  • [6] PORTING AN AGGREGATION-BASED ALGEBRAIC MULTIGRID METHOD TO GPUS
    Abdeselam A.E.H.
    Napov A.
    Notay Y.
    Electronic Transactions on Numerical Analysis, 2022, 55 : 687 - 705
  • [7] Large scale parallelisation of the material point method with multiple GPUs
    Dong, Youkou
    Grabe, Juergen
    COMPUTERS AND GEOTECHNICS, 2018, 101 : 149 - 158
  • [8] PROJECTION Algorithm for Motif Finding on GPUs
    Clemente, Jhoirene B.
    Cabarle, Francis George C.
    Adorna, Henry N.
    THEORY AND PRACTICE OF COMPUTATION, 2012, 5 : 101 - 115
  • [9] A parallel lattice Boltzmann method for large eddy simulation on multiple GPUs
    Qinjian Li
    Chengwen Zhong
    Kai Li
    Guangyong Zhang
    Xiaowei Lu
    Qing Zhang
    Kaiyong Zhao
    Xiaowen Chu
    Computing, 2014, 96 : 479 - 501
  • [10] FMM-based vortex method for simulation of isotropic turbulence on GPUs, compared with a spectral method
    Yokota, Rio
    Barba, L. A.
    COMPUTERS & FLUIDS, 2013, 80 : 17 - 27