Evaluation of performance portability frameworks for the implementation of a particle-in-cell code

被引:16
作者
Artigues, Victor [1 ,2 ]
Kormann, Katharina [2 ]
Rampp, Markus [1 ]
Reuter, Klaus [1 ]
机构
[1] Max Planck Comp & Data Facil, D-85748 Garching, Germany
[2] Max Planck Inst Plasma Phys, Garching, Germany
关键词
CUDA; Kokkos; OpenACC; OpenMP; particle in cell; performance portability; RAJA;
D O I
10.1002/cpe.5640
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper reports on an in-depth evaluation of the performance portability frameworks Kokkos and RAJA with respect to their suitability for the implementation of complex particle-in-cell (PIC) simulation codes, extending previous studies based on codes from other domains. At the example of a particle-in-cell model, we implemented the hotspot of the code in C++ and parallelized it using OpenMP, OpenACC, CUDA, Kokkos, and RAJA, targeting multi-core (CPU) and graphics (GPU) processors. Both Kokkos and RAJA appear mature, are usable for complex codes, and keep their promise to provide performance portability across different architectures. Comparing the obtainable performance on state-of-the art hardware, but also considering aspects such as code complexity, feature availability, and overall productivity, we finally draw the conclusion that the Kokkos framework would be suited best to tackle the massively parallel implementation of the full PIC model.
引用
收藏
页数:23
相关论文
共 31 条
[21]  
NVIDIA, 2017, Nvidia dgx-1 with tesla v100 system architecture, P1
[22]   Towards the optimization of a gyrokinetic Particle-In-Cell (PIC) code on large-scale hybrid architectures [J].
Ohana, N. ;
Jocksch, A. ;
Lanti, E. ;
Tran, T. M. ;
Brunner, S. ;
Gheller, C. ;
Hariri, F. ;
Villard, L. .
JOINT VARENNA-LAUSANNE INTERNATIONAL WORKSHOP ON THE THEORY OF FUSION PLASMAS 2016, 2016, 775
[23]  
OpenMP Architecture Review Board, 2018, OPENMP APPL PROGR IN
[24]  
Strohmaier E., 2019, TOP 500 LIST
[25]  
Sunderland D, 2016, 2016 2 INT WORKSH EX
[26]  
T. M. Forum, 1993, MPI MESS PASS INT
[27]  
Treibig J, 2010, 2010 39 INT C PAR PR
[28]   Warp-X: A new exascale computing platform for beam-plasma simulations [J].
Vay, J. -L. ;
Almgren, A. ;
Bell, J. ;
Ge, L. ;
Grote, D. P. ;
Hogan, M. ;
Kononenko, O. ;
Lehe, R. ;
Myers, A. ;
Ng, C. ;
Park, J. ;
Ryne, R. ;
Shapoval, O. ;
Thevenet, M. ;
Zhang, W. .
NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2018, 909 :476-479
[29]  
Wolfe M, 2014, COMPILERS MORE MPI X
[30]  
Zenker E., 2016, 2016 IEEE INT PAR DI