The BiConjugate gradient method on GPUs

被引:12
作者
Ortega, G. [1 ]
Garzon, E. M. [1 ]
Vazquez, F. [1 ]
Garcia, I. [2 ]
机构
[1] Univ Almeria, Dpt Comput Archit & Electron, Almeria 04120, Spain
[2] Univ Malaga, Dpt Comput Archit, E-29071 Malaga, Spain
关键词
BiConjugate gradient method; GPU computing; Parallel computing; Linear system of equations;
D O I
10.1007/s11227-012-0761-2
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In a wide variety of applications from different scientific and engineering fields, the solution of complex and/or nonsymmetric linear systems of equations is required. To solve this kind of linear systems the BiConjugate Gradient method (BCG) is especially relevant. Nevertheless, BCG has a enormous computational cost. GPU computing is useful for accelerating this kind of algorithms but it is necessary to develop suitable implementations to optimally exploit the GPU architecture. In this paper, we show how BCG can be effectively accelerated when all operations are computed on a GPU. So, BCG has been implemented with two alternative routines of the Sparse Matrix Vector product (SpMV): the CUSPARSE library and the ELLR-T routine. Although our interest is focused on complex matrices, our implementation has been evaluated on a GPU for two sets of test matrices: complex and real, in single and double precision data. Experimental results show that BCG based on ELLR-T routine achieves the best performance, particularly for the set of complex test matrices. Consequently, this method can be useful as a tool to efficiently solve large linear system of equations (complex and/or nonsymmetric) involved in a broad range of applications.
引用
收藏
页码:49 / 58
页数:10
相关论文
共 50 条
  • [31] Scalable Prototype Learning Using GPUs
    Su, Tonghua
    Li, Songze
    Ma, Peijun
    Deng, Shengchun
    Liang, Guangsheng
    IMAGE ANALYSIS AND RECOGNITION, ICIAR 2014, PT I, 2014, 8814 : 309 - 319
  • [32] Solving the Examination Timetabling Problem in GPUs
    Kolonias, Vasileios
    Goulas, George
    Gogos, Christos
    Alefragis, Panayiotis
    Housos, Efthymios
    ALGORITHMS, 2014, 7 (03) : 295 - 327
  • [33] Evaluation of Autoparallelization Toolkits for Commodity GPUs
    Williams, David
    Codreanu, Valeriu
    Yang, Po
    Liu, Baoquan
    Dong, Feng
    Yasar, Burhan
    Mahdian, Babak
    Chiarini, Alessandro
    Zhao, Xia
    Roerdink, Jos B. T. M.
    PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2013), PT I, 2014, 8384 : 447 - 457
  • [34] Accelerating advanced MRI reconstructions on GPUs
    Stone, S. S.
    Haldar, J. P.
    Tsao, S. C.
    Hwu, W. -m. W.
    Sutton, B. P.
    Liang, Z. -P.
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2008, 68 (10) : 1307 - 1318
  • [35] Massively Parallel Network Coding on GPUs
    Chu, Xiaowen
    Zhao, Kaiyong
    Wang, Mea
    2008 IEEE INTERNATIONAL PERFORMANCE, COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC 2008), 2008, : 144 - 151
  • [36] Scalable Energy Games Solvers on GPUs
    Formisano, Andrea
    Gentilini, Raffaella
    Vella, Flavio
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (12) : 2970 - 2982
  • [37] Automatic code generation for GPUs in llc
    Ruyman Reyes
    Francisco de Sande
    The Journal of Supercomputing, 2011, 58 : 349 - 356
  • [38] Global magnetohydrodynamic simulations on multiple GPUs
    Wong, Un-Hong
    Wong, Hon-Cheng
    Ma, Yonghui
    COMPUTER PHYSICS COMMUNICATIONS, 2014, 185 (01) : 144 - 152
  • [39] A delayed weighted gradient method for strictly convex quadratic minimization
    Harry Fernando Oviedo Leon
    Computational Optimization and Applications, 2019, 74 : 729 - 746
  • [40] A delayed weighted gradient method for strictly convex quadratic minimization
    Oviedo Leon, Harry Fernando
    COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2019, 74 (03) : 729 - 746