Performance of a Second Order Electrostatic Particle-in-Cell Algorithm on Modern Many-Core Architectures

被引:5
作者
Brown, Dominic A. S. [1 ]
Wright, Steven A. [1 ]
Jarvis, Stephen A. [1 ]
机构
[1] Univ Warwick, Dept Comp Sci, Coventry, W Midlands, England
关键词
Particle-in-Cell; PIC; Second Order Algorithms; Many-Core; P100; KNL; K20; GPU; Broadwell;
D O I
10.1016/j.entcs.2018.09.006
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper we present the outline of a novel electrostatic, second order Particle-in-Cell (PIC) algorithm, that makes use of 'ghost particle& located around true particle positions in order to represent a charge distribution. We implement our algorithm within EMPIRE-PIC, a PIC code developed at Sandia National Laboratories. We test the performance of our algorithm on a variety of many-core architectures including NVIDIA GPUs, conventional CPUs, and Intel's Knights Landing. Our preliminary results show the viability of second order methods for PIC applications on these architectures when compared to previous generations of many-core hardware. Specifically, we see an order of magnitude improvement in performance for second order methods between the Tesla K20 and Tesla P100 GPU devices, despite only a 4x improvement in the theoretical peak performance between the devices. Although these initial results show a large increase in runtime over first order methods, we hope to be able to show improved scaling behaviour and increased simulation accuracy in the future.
引用
收藏
页码:67 / 84
页数:18
相关论文
共 18 条
[1]   Contemporary particle-in-cell approach to laser-plasma modelling [J].
Arber, T. D. ;
Bennett, K. ;
Brady, C. S. ;
Lawrence-Douglas, A. ;
Ramsay, M. G. ;
Sircombe, N. J. ;
Gillies, P. ;
Evans, R. G. ;
Schmitz, H. ;
Bell, A. R. ;
Ridgers, C. P. .
PLASMA PHYSICS AND CONTROLLED FUSION, 2015, 57 (11)
[2]  
Boris J. P, 1971, P 4 C NUM SIM PLASM, P3
[3]   Adaptable Particle-in-Cell algorithms for graphical processing units [J].
Decyk, Viktor K. ;
Singh, Tajendra V. .
COMPUTER PHYSICS COMMUNICATIONS, 2011, 182 (03) :641-648
[4]   Kokkos: Enabling manycore performance portability through polymorphic memory access patterns [J].
Edwards, H. Carter ;
Trott, Christian R. ;
Sunderland, Daniel .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2014, 74 (12) :3202-3216
[5]  
Grote DP, 2005, AIP CONF PROC, V749, P55, DOI 10.1063/1.1893366
[6]   An overview of the Trilinos Project [J].
Heroux, MA ;
Bartlett, RA ;
Howle, VE ;
Hoekstra, RJ ;
Hu, JJ ;
Kolda, TG ;
Lehoucq, RB ;
Long, KR ;
Pawlowski, RP ;
Phipps, ET ;
Salinger, AG ;
Thornquist, HK ;
Tuminaro, RS ;
Willenbring, JM ;
Williams, A ;
Stanley, KS .
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2005, 31 (03) :397-423
[7]   Nodal high-order methods on unstructured grids - I. Time-domain solution of Maxwell's equations [J].
Hesthaven, JS ;
Warburton, T .
JOURNAL OF COMPUTATIONAL PHYSICS, 2002, 181 (01) :186-221
[8]   High-order nodal discontinuous Galerkin particle-in-cell method on unstructured grids [J].
Jacobs, GB ;
Hesthaven, JS .
JOURNAL OF COMPUTATIONAL PHYSICS, 2006, 214 (01) :96-121
[9]  
Jarvis S. A., 2016, INT J HIGH PERFORMAN
[10]   Particle-in-cell simulations with charge-conserving current deposition on graphic processing units [J].
Kong, Xianglong ;
Huang, Michael C. ;
Ren, Chuang ;
Decyk, Viktor K. .
JOURNAL OF COMPUTATIONAL PHYSICS, 2011, 230 (04) :1676-1685