Iterative Methods for Sparse Linear Systems on Graphics Processing Unit

被引:15
|
作者
Ahamed, Abal-Kassim Cheik [1 ]
Magoules, Frederic [1 ]
机构
[1] Ecole Cent Paris, Appl Math & Syst Lab, Chatenay Malabry, France
来源
2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS) | 2012年
关键词
Krylov methods; linear algebra; sparse matrix-vector multiplication; graphics processing unit; CUDA; CUBLAS; CUSPARSE; Cusp; DOMAIN DECOMPOSITION METHODS;
D O I
10.1109/HPCC.2012.118
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Many engineering and science problems require a computational effort to solve large sparse linear systems. Krylov subspace based iterative solvers have been widely used in that direction. Iterative Krylov methods involve linear algebra operations such as summation of vectors, dot product, norm, and matrix-vector multiplication. Since these operations could be very costly in computation time on Central Processing Unit (CPU), we propose in this paper to focus on the design of iterative solvers to take advantage of massive parallelism of Graphics Processing Unit (GPU). We consider Stabilized BiConjugate Gradient (BiCGStab), Stabilized BiConjugate Gradient (L) (BiCGStabl), Generalized Conjugate Residual (P-GCR), Bi-Conjugate Gradient Conjugate Residual (P-BiCGCR), transpose-free Quasi Minimal Residual (P-tfQMR) for the solution of sparse linear systems with non symmetric matrices and Conjugate Gradient (CG) for symmetric positive definite matrices. We discuss data format and data structure for sparse matrices, and how to efficiently implement these solvers on the Nvidia's CUDA platform. The scalability and performance of the methods are tested on several engineering problems, together with numerous numerical experiments which clearly illustrate the robustness, competitiveness and efficiency of our own proper implementation compared to the existing libraries.
引用
收藏
页码:836 / 842
页数:7
相关论文
共 50 条
  • [1] Auto-tuned Krylov methods on cluster of graphics processing unit
    Magoules, Frederic
    Ahamed, Abal-Kassim Cheik
    Putanowicz, Roman
    INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2015, 92 (06) : 1222 - 1250
  • [2] Iterative Krylov Methods for Acoustic Problems on Graphics Processing Unit
    Ahamed, Abal-Kassim Cheik
    Magoules, Frederic
    PROCEEDINGS OF THIRTEENTH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING AND SCIENCE, (DCABES 2014), 2014, : 19 - 23
  • [3] Iterative Krylov Methods for Gravity problems on Graphics Processing Unit
    Ahamed, Abal-Kassim Cheik
    Magoules, Frederic
    2013 12TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING & SCIENCE (DCABES), 2013, : 16 - 20
  • [4] Fast and Green Computing with Graphics Processing Units for solving Sparse Linear Systems
    Ahamed, Abal-Kassim Cheik
    Desmaison, Alban
    Magoules, Frederic
    2014 IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2014 IEEE 6TH INTL SYMP ON CYBERSPACE SAFETY AND SECURITY, 2014 IEEE 11TH INTL CONF ON EMBEDDED SOFTWARE AND SYST (HPCC,CSS,ICESS), 2014, : 129 - 136
  • [5] Fast Sparse Matrix-Vector Multiplication on Graphics Processing Unit for Finite Element Analysis
    Ahamed, Abal-Kassim Cheik
    Magoules, Frederic
    2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS), 2012, : 1307 - 1314
  • [6] Accelerating Sparse Linear Algebra Using Graphics Processing Units
    Spagnoli, Kyle E.
    Humphrey, John R.
    Price, Daniel K.
    Kelmelis, Eric J.
    MODELING AND SIMULATION FOR DEFENSE SYSTEMS AND APPLICATIONS VI, 2011, 8060
  • [7] Parallel Sub-Structuring Methods for solving Sparse Linear Systems on a cluster of GPU
    Ahamed, Abal-Kassim Cheik
    Magoules, Frederic
    2014 IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2014 IEEE 6TH INTL SYMP ON CYBERSPACE SAFETY AND SECURITY, 2014 IEEE 11TH INTL CONF ON EMBEDDED SOFTWARE AND SYST (HPCC,CSS,ICESS), 2014, : 121 - 128
  • [8] Iterative sparse matrix-vector multiplication for accelerating the block Wiedemann algorithm over GF(2) on multi-graphics processing unit systems
    Schmidt, Bertil
    Aribowo, Hans
    Hoang-Vu Dang
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2013, 25 (04) : 586 - 603
  • [9] Probabilistic Iterative Methods for Linear Systems
    Cockayne, Jon
    Ipsen, Ilse C. F.
    Oates, Chris J.
    Reid, Tim W.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
  • [10] Towards using the Graphics Processing Unit (GPU) for Embedded Systems
    Hallmans, Daniel
    Asberg, Mikael
    Nolte, Thomas
    2012 IEEE 17TH CONFERENCE ON EMERGING TECHNOLOGIES & FACTORY AUTOMATION (ETFA), 2012,