Iterative Methods for Sparse Linear Systems on Graphics Processing Unit

被引:15
作者
Ahamed, Abal-Kassim Cheik [1 ]
Magoules, Frederic [1 ]
机构
[1] Ecole Cent Paris, Appl Math & Syst Lab, Chatenay Malabry, France
来源
2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS) | 2012年
关键词
Krylov methods; linear algebra; sparse matrix-vector multiplication; graphics processing unit; CUDA; CUBLAS; CUSPARSE; Cusp; DOMAIN DECOMPOSITION METHODS;
D O I
10.1109/HPCC.2012.118
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Many engineering and science problems require a computational effort to solve large sparse linear systems. Krylov subspace based iterative solvers have been widely used in that direction. Iterative Krylov methods involve linear algebra operations such as summation of vectors, dot product, norm, and matrix-vector multiplication. Since these operations could be very costly in computation time on Central Processing Unit (CPU), we propose in this paper to focus on the design of iterative solvers to take advantage of massive parallelism of Graphics Processing Unit (GPU). We consider Stabilized BiConjugate Gradient (BiCGStab), Stabilized BiConjugate Gradient (L) (BiCGStabl), Generalized Conjugate Residual (P-GCR), Bi-Conjugate Gradient Conjugate Residual (P-BiCGCR), transpose-free Quasi Minimal Residual (P-tfQMR) for the solution of sparse linear systems with non symmetric matrices and Conjugate Gradient (CG) for symmetric positive definite matrices. We discuss data format and data structure for sparse matrices, and how to efficiently implement these solvers on the Nvidia's CUDA platform. The scalability and performance of the methods are tested on several engineering problems, together with numerous numerical experiments which clearly illustrate the robustness, competitiveness and efficiency of our own proper implementation compared to the existing libraries.
引用
收藏
页码:836 / 842
页数:7
相关论文
共 50 条
  • [31] Pipelined Iterative Solvers with Kernel Fusion for Graphics Processing Units
    Rupp, Karl
    Weinbub, Josef
    Juengel, Ansgar
    Grasser, Tibor
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2016, 43 (02):
  • [32] Parallelization of heterogeneous reactor calculations on a graphics processing unit
    Malofeev, V. M.
    Pal'shin, V. A.
    PHYSICS OF ATOMIC NUCLEI, 2016, 79 (08) : 1246 - 1251
  • [33] General purpose computing of graphics processing unit: A survey
    Wang, Hai-Feng
    Chen, Qing-Kui
    Jisuanji Xuebao/Chinese Journal of Computers, 2013, 36 (04): : 757 - 772
  • [34] PSIM: A Modular Particle System on Graphics Processing Unit
    Aquotte, F. A.
    da Silva, A. F.
    IEEE LATIN AMERICA TRANSACTIONS, 2014, 12 (02) : 321 - 329
  • [35] GRAPHICS PROCESSING UNIT BASED COMPUTATION FOR NDE APPLICATIONS
    Nahas, C. A.
    Rajagopal, Prabhu
    Balasubramaniam, Krishnan
    Krishnamurthy, C. V.
    REVIEW OF PROGRESS IN QUANTITATIVE NONDESTRUCTIVE EVALUATION, VOLS 31A AND 31B, 2012, 1430 : 1998 - 2005
  • [36] Collaborative Parallel Hybrid Metaheuristics on Graphics Processing Unit
    Roberge, Vincent
    Tarbouchi, Mohammed
    Okou, Francis
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2015, 14 (01)
  • [37] Parallelization of heterogeneous reactor calculations on a graphics processing unit
    V. M. Malofeev
    V. A. Pal’shin
    Physics of Atomic Nuclei, 2016, 79 : 1246 - 1251
  • [38] A grand spread estimator using a graphics processing unit
    Shin, Seon-Ho
    Im, Eun-Jin
    Yoon, MyungKeun
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2014, 74 (02) : 2039 - 2047
  • [39] Parallel Implementation of Segmentation Algorithms on Graphics Processing Unit
    Yenialp, Erdal
    Kalkan, Habil
    2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
  • [40] Efficient Optical Character Recognition on Graphics Processing Unit
    Arianyan, Ehsan
    Motamedi, Seyed Ahmad
    Arianyan, Iman
    2012 SIXTH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2012, : 789 - 793