The BiConjugate gradient method on GPUs

被引：12

作者：

Ortega, G. ^{[1
]}

Garzon, E. M. ^{[1
]}

Vazquez, F. ^{[1
]}

Garcia, I. ^{[2
]}

机构：

[1] Univ Almeria, Dpt Comput Archit & Electron, Almeria 04120, Spain

[2] Univ Malaga, Dpt Comput Archit, E-29071 Malaga, Spain

来源：

JOURNAL OF SUPERCOMPUTING | 2013年 / 64卷 / 01期

关键词：

BiConjugate gradient method; GPU computing; Parallel computing; Linear system of equations;

D O I：

10.1007/s11227-012-0761-2

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In a wide variety of applications from different scientific and engineering fields, the solution of complex and/or nonsymmetric linear systems of equations is required. To solve this kind of linear systems the BiConjugate Gradient method (BCG) is especially relevant. Nevertheless, BCG has a enormous computational cost. GPU computing is useful for accelerating this kind of algorithms but it is necessary to develop suitable implementations to optimally exploit the GPU architecture. In this paper, we show how BCG can be effectively accelerated when all operations are computed on a GPU. So, BCG has been implemented with two alternative routines of the Sparse Matrix Vector product (SpMV): the CUSPARSE library and the ELLR-T routine. Although our interest is focused on complex matrices, our implementation has been evaluated on a GPU for two sets of test matrices: complex and real, in single and double precision data. Experimental results show that BCG based on ELLR-T routine achieves the best performance, particularly for the set of complex test matrices. Consequently, this method can be useful as a tool to efficiently solve large linear system of equations (complex and/or nonsymmetric) involved in a broad range of applications.

引用

页码：49 / 58

页数：10

共 50 条

[31] Scalable Prototype Learning Using GPUs
Su, Tonghua
Li, Songze
Ma, Peijun
Deng, Shengchun
Liang, Guangsheng
IMAGE ANALYSIS AND RECOGNITION, ICIAR 2014, PT I, 2014, 8814 : 309 - 319
[32] Solving the Examination Timetabling Problem in GPUs
Kolonias, Vasileios
Goulas, George
Gogos, Christos
Alefragis, Panayiotis
Housos, Efthymios
ALGORITHMS, 2014, 7 (03) : 295 - 327
[33] Evaluation of Autoparallelization Toolkits for Commodity GPUs
Williams, David
Codreanu, Valeriu
Yang, Po
Liu, Baoquan
Dong, Feng
Yasar, Burhan
Mahdian, Babak
Chiarini, Alessandro
Zhao, Xia
Roerdink, Jos B. T. M.
PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2013), PT I, 2014, 8384 : 447 - 457
[34] Accelerating advanced MRI reconstructions on GPUs
Stone, S. S.
Haldar, J. P.
Tsao, S. C.
Hwu, W. -m. W.
Sutton, B. P.
Liang, Z. -P.
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2008, 68 (10) : 1307 - 1318
[35] Massively Parallel Network Coding on GPUs
Chu, Xiaowen
Zhao, Kaiyong
Wang, Mea
2008 IEEE INTERNATIONAL PERFORMANCE, COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC 2008), 2008, : 144 - 151
[36] Scalable Energy Games Solvers on GPUs
Formisano, Andrea
Gentilini, Raffaella
Vella, Flavio
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (12) : 2970 - 2982
[37] Automatic code generation for GPUs in llc
Ruyman Reyes
Francisco de Sande
The Journal of Supercomputing, 2011, 58 : 349 - 356
[38] Global magnetohydrodynamic simulations on multiple GPUs
Wong, Un-Hong
Wong, Hon-Cheng
Ma, Yonghui
COMPUTER PHYSICS COMMUNICATIONS, 2014, 185 (01) : 144 - 152
[39] A delayed weighted gradient method for strictly convex quadratic minimization
Harry Fernando Oviedo Leon
Computational Optimization and Applications, 2019, 74 : 729 - 746
[40] A delayed weighted gradient method for strictly convex quadratic minimization
Oviedo Leon, Harry Fernando
COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2019, 74 (03) : 729 - 746

← 1 2 3 4 5 →