Iterative Methods for Sparse Linear Systems on Graphics Processing Unit

被引:15
|
作者
Ahamed, Abal-Kassim Cheik [1 ]
Magoules, Frederic [1 ]
机构
[1] Ecole Cent Paris, Appl Math & Syst Lab, Chatenay Malabry, France
来源
2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS) | 2012年
关键词
Krylov methods; linear algebra; sparse matrix-vector multiplication; graphics processing unit; CUDA; CUBLAS; CUSPARSE; Cusp; DOMAIN DECOMPOSITION METHODS;
D O I
10.1109/HPCC.2012.118
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Many engineering and science problems require a computational effort to solve large sparse linear systems. Krylov subspace based iterative solvers have been widely used in that direction. Iterative Krylov methods involve linear algebra operations such as summation of vectors, dot product, norm, and matrix-vector multiplication. Since these operations could be very costly in computation time on Central Processing Unit (CPU), we propose in this paper to focus on the design of iterative solvers to take advantage of massive parallelism of Graphics Processing Unit (GPU). We consider Stabilized BiConjugate Gradient (BiCGStab), Stabilized BiConjugate Gradient (L) (BiCGStabl), Generalized Conjugate Residual (P-GCR), Bi-Conjugate Gradient Conjugate Residual (P-BiCGCR), transpose-free Quasi Minimal Residual (P-tfQMR) for the solution of sparse linear systems with non symmetric matrices and Conjugate Gradient (CG) for symmetric positive definite matrices. We discuss data format and data structure for sparse matrices, and how to efficiently implement these solvers on the Nvidia's CUDA platform. The scalability and performance of the methods are tested on several engineering problems, together with numerous numerical experiments which clearly illustrate the robustness, competitiveness and efficiency of our own proper implementation compared to the existing libraries.
引用
收藏
页码:836 / 842
页数:7
相关论文
共 50 条
  • [21] Alinea: An Advanced Linear Algebra Library for Massively Parallel Computations on Graphics Processing Units
    Magoules, Frederic
    Ahamed, Abal-Kassim Cheik
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2015, 29 (03) : 284 - 310
  • [22] Optimized Schwarz method without overlap for the gravitational potential equation on cluster of graphics processing unit
    Magoules, Frederic
    Ahamed, Abal-Kassim Cheik
    Putanowicz, Roman
    INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2016, 93 (06) : 955 - 980
  • [23] Distribution System Optimization on Graphics Processing Unit
    Roberge, Vincent
    Tarbouchi, Mohammed
    Okou, Francis A.
    IEEE TRANSACTIONS ON SMART GRID, 2017, 8 (04) : 1689 - 1699
  • [24] Triangular Matrix Inversion on Graphics Processing Unit
    Ries, Florian
    De Marco, Tommaso
    Zivieri, Matteo
    Guerrieri, Roberto
    PROCEEDINGS OF THE CONFERENCE ON HIGH PERFORMANCE COMPUTING NETWORKING, STORAGE AND ANALYSIS, 2009,
  • [25] Malware Detection via a Graphics Processing Unit
    Kovach, Nicholas
    Mullins, Barry
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INFORMATION WARFARE AND SECURITY, 2010, : 212 - 215
  • [26] Implementation of Multilayer Perceptron in Graphics Processing Unit
    Nikolov, Ventsislav
    ARTIFICIAL INTELLIGENCE: METHODOLOGY, SYSTEMS, AND APPLICATIONS, AIMSA 2018, 2018, 11089 : 270 - 274
  • [27] Firewall Engine based on Graphics Processing Unit
    Sahoo, Abhaya Kumar
    Das, Amardeep
    Tiwary, Mayank
    2014 INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES (ICACCCT), 2014, : 758 - 763
  • [28] Accelerating the RTTOV-7 IASI and AMSU-A radiative transfer models on graphics processing units: evaluating central processing unit/graphics processing unit-hybrid and pure-graphics processing unit approaches
    Mielikainen, Jarno
    Huang, Bormin
    Huang, Hung-Lung Allen
    Saunders, Roger
    JOURNAL OF APPLIED REMOTE SENSING, 2011, 5
  • [29] Parallel and distributed computing models on a graphics processing unit to accelerate simulation of membrane systems
    Maroosi, Ali
    Muniyandi, Ravie Chandren
    Sundararajan, Elankovan
    Zin, Abdullah Mohd
    SIMULATION MODELLING PRACTICE AND THEORY, 2014, 47 : 60 - 78
  • [30] Pipelined Iterative Solvers with Kernel Fusion for Graphics Processing Units
    Rupp, Karl
    Weinbub, Josef
    Juengel, Ansgar
    Grasser, Tibor
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2016, 43 (02):