Conjugate gradient method with graphics processing unit acceleration: CUDA vs OpenCL

被引:12
作者
Ahamed, Abal-Kassim Cheik [1 ,2 ]
Magoules, Frederic [1 ]
机构
[1] Univ Paris Saclay, Cent Supelec, F-92290 Chatenay Malabry, France
[2] Univ Paris 13, Sorbonne Paris Cite, F-93430 Villetaneuse, France
关键词
Conjugate gradient method; Iterative method; Linear algebra; Sparse matrix-vector product; Parallel computing; GPU; CUDA; OpenCL; DOMAIN DECOMPOSITION METHODS;
D O I
10.1016/j.advengsoft.2016.10.002
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Performance computations depend on the machine architecture, the operating system, the problem studied and obviously on the programming implementation. Solving partial differential equations by numerical methods such as the finite element method requires the solution of large sparse linear systems. Graphics processing unit (GPU) is now commonly used to accelerate numerical simulations and most supercomputers provide large number of GPUs to their users. This paper proposes a comparison of both CUDA and OpenCL GPU languages to take the highest performance of multi-GPUs clusters. We analyse, evaluate and compare their respective performances for computing linear algebra operations and for solving large sparse linear systems with the conjugate gradient iterative method on multi-GPUs clusters. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:32 / 42
页数:11
相关论文
共 40 条
[1]   Energy Consumption Analysis on Graphics Processing Units [J].
Ahamed, Abal-Kassim Cheik ;
Magoules, Frederic .
PROCEEDINGS OF THIRTEENTH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING AND SCIENCE, (DCABES 2014), 2014, :46-50
[2]   Iterative Krylov Methods for Acoustic Problems on Graphics Processing Unit [J].
Ahamed, Abal-Kassim Cheik ;
Magoules, Frederic .
PROCEEDINGS OF THIRTEENTH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING AND SCIENCE, (DCABES 2014), 2014, :19-23
[3]   Iterative Krylov Methods for Gravity problems on Graphics Processing Unit [J].
Ahamed, Abal-Kassim Cheik ;
Magoules, Frederic .
2013 12TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING & SCIENCE (DCABES), 2013, :16-20
[4]   Schwarz Method with Two-sided Transmission Conditions for the Gravity Equations on Graphics Processing Unit [J].
Ahamed, Abal-Kassim Cheik ;
Magoules, Frederic .
2013 12TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING & SCIENCE (DCABES), 2013, :105-109
[5]   Iterative Methods for Sparse Linear Systems on Graphics Processing Unit [J].
Ahamed, Abal-Kassim Cheik ;
Magoules, Frederic .
2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS), 2012, :836-842
[6]  
[Anonymous], IEEE754
[7]  
[Anonymous], 2008, NVIDIA Technical Report NVR-2008-004
[8]  
[Anonymous], DIFUSION CIENTIFICA
[9]  
[Anonymous], 2003, ITERATIVE METHODS SP, DOI DOI 10.1137/1.9780898718003
[10]  
Bell N, 2009, STUDENTS GUIDE TO THE MA TESOL, P1