Accelerating COBAYA3 on multi-core CPU and GPU systems using PARALUTION

被引:5
|
作者
Trost, Nico [1 ]
Jimenez, Javier [1 ]
Lukarski, Dimitar [2 ]
Sanchez, Victor [1 ]
机构
[1] Karlsruhe Inst Technol, Inst Neutron Phys & Reactor Technol, D-76344 Eggenstein Leopoldshafen, Germany
[2] Uppsala Univ, Dept Informat Technol, Div Comp Sci, S-75237 Uppsala, Sweden
关键词
COBAYA3; PARALUTION; Acceleration; Parallelization; Multi-core; GPUs;
D O I
10.1016/j.anucene.2014.08.005
中图分类号
TL [原子能技术]; O571 [原子核物理学];
学科分类号
0827 ; 082701 ;
摘要
COBAYA3 is a multi-physics system of codes which includes two 3D multi-group neutron diffusion codes, ANDES and COBAYA3-PBP, coupled with COBRA-TF, COBRA-IIIc and SUBCHANFLOW sub-channel thermal-hydraulic codes, for the simulation of LWR core transients. The 3D multi-group neutron diffusion equations are expressed in terms of a sparse linear system which can be solved using different iterative Krylov subspace solvers. The mathematical SPARSKIT library has been used for these purposes as it implements among others, external GMRES, PGMRES and BiCGStab solvers. Multi-core CPUs and graphical processing units (GPUs) provide high performance capabilities which are able to accelerate many scientific computations. To take advantage of these new hardware features in daily use computer codes, the integration of the PARALUTION library to solve sparse systems of linear equations is a good choice. It features several types of iterative solvers and preconditioners which can run on both multi-core CPUs and GPU devices without any modification from the interface point of view. This feature is due to the great portability obtained by the modular and flexible design of the library. By exploring this technology, namely the implementation of the PARALUTION library in COBAYA3, we are able to decrease the solution time of the sparse linear systems by a factor 5.15x on GPU and 2.56x on multi-core CPU using standard hardware. These obtained speedup factors in addition to the implementation details are discussed in this paper. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:252 / 259
页数:8
相关论文
共 50 条
  • [21] Fine-grain parallelism using multi-core, Cell/BE, and GPU Systems
    不详
    PARALLEL COMPUTING, 2012, 38 (08) : 365 - 390
  • [22] Stochastic Gradient Descent on Modem Hardware: Multi-core CPU or GPU? Synchronous or Asynchronous?
    Ma, Yujing
    Rusu, Florin
    Torres, Martin
    2019 IEEE 33RD INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2019), 2019, : 1063 - 1072
  • [23] A Fast Parallel GPS Acquisition Algorithm Based on Hybrid GPU and Multi-core CPU
    Kakooei, Mohammad
    Tabatabaei, Amir
    WIRELESS PERSONAL COMMUNICATIONS, 2019, 104 (04) : 1355 - 1366
  • [24] A Fast Parallel GPS Acquisition Algorithm Based on Hybrid GPU and Multi-core CPU
    Mohammad Kakooei
    Amir Tabatabaei
    Wireless Personal Communications, 2019, 104 : 1355 - 1366
  • [25] A Method to Accelerate K-means and GMM Computation with GPU and Multi-core CPU
    Gao, Boyang
    Liu, Ningning
    Wang, Xiangting
    Lan, Mengfei
    Zhao, Zixuan
    Dellandrea, Emmanuel
    Chen, Liming
    2018 IEEE FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2018,
  • [26] pyPaSWAS: Python']Python-based multi-core CPU and GPU sequence alignment
    Warris, Sven
    Timal, N. Roshan N.
    Kempenaar, Marcel
    Poortinga, Arne M.
    van de Geest, Henri
    Varbanescu, Ana L.
    Nap, Jan-Peter
    PLOS ONE, 2018, 13 (01):
  • [27] Algorithmic skeletons for multi-core, multi-GPU systems and clusters
    Ernsting, Steffen
    Kuchen, Herbert
    International Journal of High Performance Computing and Networking, 2012, 7 (02) : 129 - 138
  • [28] Performance Analysis of GPU compared to Single-core and Multi-core CPU for Natural Language Applications
    Gupta, Shubham
    Babu, M. Rajasekhara
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2011, 2 (05) : 50 - 53
  • [29] Accelerating the MMD algorithm using Multi-core Environments
    Schloesser, Michael
    Herpers, Rainer
    Kent, Kenneth B.
    2011 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING (PACRIM), 2011, : 340 - 345
  • [30] Investigations of boron transport in a PWR core with COBAYA3/SUBCHANFLOW inside the NURESIM platform
    Calleja, M.
    Jimenez, J.
    Sanchez, V.
    Imke, U.
    Stieglitz, R.
    Macian, R.
    ANNALS OF NUCLEAR ENERGY, 2014, 66 : 74 - 84