Accelerating COBAYA3 on multi-core CPU and GPU systems using PARALUTION

被引:5
|
作者
Trost, Nico [1 ]
Jimenez, Javier [1 ]
Lukarski, Dimitar [2 ]
Sanchez, Victor [1 ]
机构
[1] Karlsruhe Inst Technol, Inst Neutron Phys & Reactor Technol, D-76344 Eggenstein Leopoldshafen, Germany
[2] Uppsala Univ, Dept Informat Technol, Div Comp Sci, S-75237 Uppsala, Sweden
关键词
COBAYA3; PARALUTION; Acceleration; Parallelization; Multi-core; GPUs;
D O I
10.1016/j.anucene.2014.08.005
中图分类号
TL [原子能技术]; O571 [原子核物理学];
学科分类号
0827 ; 082701 ;
摘要
COBAYA3 is a multi-physics system of codes which includes two 3D multi-group neutron diffusion codes, ANDES and COBAYA3-PBP, coupled with COBRA-TF, COBRA-IIIc and SUBCHANFLOW sub-channel thermal-hydraulic codes, for the simulation of LWR core transients. The 3D multi-group neutron diffusion equations are expressed in terms of a sparse linear system which can be solved using different iterative Krylov subspace solvers. The mathematical SPARSKIT library has been used for these purposes as it implements among others, external GMRES, PGMRES and BiCGStab solvers. Multi-core CPUs and graphical processing units (GPUs) provide high performance capabilities which are able to accelerate many scientific computations. To take advantage of these new hardware features in daily use computer codes, the integration of the PARALUTION library to solve sparse systems of linear equations is a good choice. It features several types of iterative solvers and preconditioners which can run on both multi-core CPUs and GPU devices without any modification from the interface point of view. This feature is due to the great portability obtained by the modular and flexible design of the library. By exploring this technology, namely the implementation of the PARALUTION library in COBAYA3, we are able to decrease the solution time of the sparse linear systems by a factor 5.15x on GPU and 2.56x on multi-core CPU using standard hardware. These obtained speedup factors in addition to the implementation details are discussed in this paper. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:252 / 259
页数:8
相关论文
共 50 条
  • [1] Accelerating COBAYA3 on multi-core CPU and GPU systems using PARALUTION
    Trost, Nico
    Jimenez, Javier
    Lukarski, Dimitar
    Sanchez, Victor
    SNA + MC 2013 - JOINT INTERNATIONAL CONFERENCE ON SUPERCOMPUTING IN NUCLEAR APPLICATIONS + MONTE CARLO, 2014,
  • [2] ACCELERATING SAR IMAGING USING VECTOR EXTENSION ON MULTI-CORE SIMD CPU
    Li, Guojun
    Zhang, Fan
    Ma, Lixiang
    Hu, Wei
    Li, Wei
    2015 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2015, : 537 - 540
  • [3] Utilization of a Web Browser for Complex Heterogeneous Parallel Computing Using Multi-core CPU/GPU Systems
    Woda, Marek
    Hajduga, Adam
    COMPUTER AIDED SYSTEMS THEORY - EUROCAST 2017, PT I, 2018, 10671 : 93 - 100
  • [4] Accelerating the SCE-UA Global Optimization Method Based on Multi-Core CPU and Many-Core GPU
    Kan, Guangyuan
    Liang, Ke
    Li, Jiren
    Ding, Liuqian
    He, Xiaoyan
    Hu, Youbing
    Amo-Boateng, Mark
    ADVANCES IN METEOROLOGY, 2016, 2016
  • [5] Acceleration of Stereo-Matching on Multi-core CPU and GPU
    Xu, Tian
    Cockshott, Paul
    Oehler, Susanne
    2014 IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2014 IEEE 6TH INTL SYMP ON CYBERSPACE SAFETY AND SECURITY, 2014 IEEE 11TH INTL CONF ON EMBEDDED SOFTWARE AND SYST (HPCC,CSS,ICESS), 2014, : 108 - 115
  • [6] Multi-Core (CPU and GPU) for Permutation-Based Indexing
    Mohamed, Hisham
    Osipyan, Hasmik
    Marchand-Maillet, Stephane
    SIMILARITY SEARCH AND APPLICATIONS, 2014, 8821 : 277 - 288
  • [7] Optimized HPL for AMD GPU and multi-core CPU usage
    Bach, Matthias
    Kretz, Matthias
    Lindenstruth, Volker
    Rohr, David
    COMPUTER SCIENCE-RESEARCH AND DEVELOPMENT, 2011, 26 (3-4): : 153 - 164
  • [8] Using Criticality of GPU Accesses in Memory Management for CPU-GPU Heterogeneous Multi-Core Processors
    Rai, Siddharth
    Chaudhuri, Mainak
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2017, 16
  • [9] Adaptively accelerating FWM2DA seismic modelling program on multi-core CPU and GPU architectures
    Londhe, Ashutosh
    Rastogi, Richa
    Srivastava, Abhishek
    Khonde, Kiran
    Sirasala, Kirannmayi M.
    Kharche, Komal
    COMPUTERS & GEOSCIENCES, 2021, 146
  • [10] Survey on Key Technologies of Graph Processing Systems Based on Multi-core CPU and GPU Platforms
    Zhang, Yuan
    Cao, Huawei
    Zhang, Jie
    Shen, Yue
    Sun, Yiming
    Dun, Ming
    An, Xuejun
    Ye, Xiaochun
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (06): : 1401 - 1428