Iterative Methods for Sparse Linear Systems on Graphics Processing Unit

被引：15

作者：

Ahamed, Abal-Kassim Cheik ^{[1
]}

Magoules, Frederic ^{[1
]}

机构：

[1] Ecole Cent Paris, Appl Math & Syst Lab, Chatenay Malabry, France

来源：

2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS) | 2012年

关键词：

Krylov methods; linear algebra; sparse matrix-vector multiplication; graphics processing unit; CUDA; CUBLAS; CUSPARSE; Cusp; DOMAIN DECOMPOSITION METHODS;

D O I：

10.1109/HPCC.2012.118

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Many engineering and science problems require a computational effort to solve large sparse linear systems. Krylov subspace based iterative solvers have been widely used in that direction. Iterative Krylov methods involve linear algebra operations such as summation of vectors, dot product, norm, and matrix-vector multiplication. Since these operations could be very costly in computation time on Central Processing Unit (CPU), we propose in this paper to focus on the design of iterative solvers to take advantage of massive parallelism of Graphics Processing Unit (GPU). We consider Stabilized BiConjugate Gradient (BiCGStab), Stabilized BiConjugate Gradient (L) (BiCGStabl), Generalized Conjugate Residual (P-GCR), Bi-Conjugate Gradient Conjugate Residual (P-BiCGCR), transpose-free Quasi Minimal Residual (P-tfQMR) for the solution of sparse linear systems with non symmetric matrices and Conjugate Gradient (CG) for symmetric positive definite matrices. We discuss data format and data structure for sparse matrices, and how to efficiently implement these solvers on the Nvidia's CUDA platform. The scalability and performance of the methods are tested on several engineering problems, together with numerous numerical experiments which clearly illustrate the robustness, competitiveness and efficiency of our own proper implementation compared to the existing libraries.

引用

页码：836 / 842

页数：7

共 50 条

[21] Alinea: An Advanced Linear Algebra Library for Massively Parallel Computations on Graphics Processing Units
Magoules, Frederic
Ahamed, Abal-Kassim Cheik
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2015, 29 (03) : 284 - 310
[22] Optimized Schwarz method without overlap for the gravitational potential equation on cluster of graphics processing unit
Magoules, Frederic
Ahamed, Abal-Kassim Cheik
Putanowicz, Roman
INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2016, 93 (06) : 955 - 980
[23] Distribution System Optimization on Graphics Processing Unit
Roberge, Vincent
Tarbouchi, Mohammed
Okou, Francis A.
IEEE TRANSACTIONS ON SMART GRID, 2017, 8 (04) : 1689 - 1699
[24] Triangular Matrix Inversion on Graphics Processing Unit
Ries, Florian
De Marco, Tommaso
Zivieri, Matteo
Guerrieri, Roberto
PROCEEDINGS OF THE CONFERENCE ON HIGH PERFORMANCE COMPUTING NETWORKING, STORAGE AND ANALYSIS, 2009,
[25] Malware Detection via a Graphics Processing Unit
Kovach, Nicholas
Mullins, Barry
PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INFORMATION WARFARE AND SECURITY, 2010, : 212 - 215
[26] Implementation of Multilayer Perceptron in Graphics Processing Unit
Nikolov, Ventsislav
ARTIFICIAL INTELLIGENCE: METHODOLOGY, SYSTEMS, AND APPLICATIONS, AIMSA 2018, 2018, 11089 : 270 - 274
[27] Firewall Engine based on Graphics Processing Unit
Sahoo, Abhaya Kumar
Das, Amardeep
Tiwary, Mayank
2014 INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES (ICACCCT), 2014, : 758 - 763
[28] Accelerating the RTTOV-7 IASI and AMSU-A radiative transfer models on graphics processing units: evaluating central processing unit/graphics processing unit-hybrid and pure-graphics processing unit approaches
Mielikainen, Jarno
Huang, Bormin
Huang, Hung-Lung Allen
Saunders, Roger
JOURNAL OF APPLIED REMOTE SENSING, 2011, 5
[29] Parallel and distributed computing models on a graphics processing unit to accelerate simulation of membrane systems
Maroosi, Ali
Muniyandi, Ravie Chandren
Sundararajan, Elankovan
Zin, Abdullah Mohd
SIMULATION MODELLING PRACTICE AND THEORY, 2014, 47 : 60 - 78
[30] Pipelined Iterative Solvers with Kernel Fusion for Graphics Processing Units
Rupp, Karl
Weinbub, Josef
Juengel, Ansgar
Grasser, Tibor
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2016, 43 (02):

← 1 2 3 4 5 →