Leveraging HPC Profiling and Tracing Tools to Understand the Performance of Particle-in-Cell Monte Carlo Simulations

被引:0
作者
Williams, Jeremy J. [1 ]
Tskhakaya, David [2 ]
Costea, Stefan [3 ]
Peng, Ivy B. [1 ]
Garcia-Gasulla, Marta [4 ]
Markidis, Stefano [1 ]
机构
[1] KTH Royal Inst Technol, Stockholm, Sweden
[2] CAS, Inst Plasma Phys, Prague, Czech Republic
[3] Univ Ljubljana, LeCAD, Ljubljana, Slovenia
[4] Barcelona Supercomp Ctr, Barcelona, Spain
来源
EURO-PAR 2023: PARALLEL PROCESSING WORKSHOPS, PT I, EURO-PAR 2023 | 2024年 / 14351卷
关键词
Performance Monitoring and Analysis; PIC Performance Bottleneck; Large-Scale PIC Simulations;
D O I
10.1007/978-3-031-50684-0_10
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Large-scale plasma simulations are critical for designing and developing next-generation fusion energy devices and modeling industrial plasmas. BIT1 is a massively parallel Particle-in-Cell code designed for specifically studying plasma material interaction in fusion devices. Its most salient characteristic is the inclusion of collision Monte Carlo models for different plasma species. In this work, we characterize single node, multiple nodes, and I/O performances of the BIT1 code in two realistic cases by using several HPC profilers, such as perf, IPM, Extrae/Paraver, and Darshan tools. We find that the BIT1 sorting function on-node performance is the main performance bottleneck. Strong scaling tests show a parallel performance of 77% and 96% on 2,560 MPI ranks for the two test cases. We demonstrate that communication, load imbalance and selfsynchronization are important factors impacting the performance of the BIT1 on large-scale runs.
引用
收藏
页码:123 / 134
页数:12
相关论文
共 17 条
[1]   Analytic Modeling of Idle Waves in Parallel Programs: Communication, Cluster Topology, and Noise Impact [J].
Afzal, Ayesha ;
Hager, Georg ;
Wellein, Gerhard .
HIGH PERFORMANCE COMPUTING, ISC HIGH PERFORMANCE 2021, 2021, 12728 :351-371
[2]  
Fuerlinger K., 2010, Proceedings 2010 IEEE 16th International Conference on Parallel and Distributed Systems (ICPADS 2010), P373, DOI 10.1109/ICPADS.2010.16
[3]  
KTH, 2023, Cachetest
[4]   Idle waves in high-performance computing [J].
Markidis, Stefano ;
Vencels, Juris ;
Peng, Ivy Bo ;
Akhmetova, Dana ;
Laure, Erwin ;
Henri, Pierre .
PHYSICAL REVIEW E, 2015, 91 (01)
[5]  
Peng IB, 2016, PROCEEDINGS OF 2016 IEEE 18TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS
[6]  
IEEE 14TH INTERNATIONAL CONFERENCE ON SMART CITY
[7]  
IEEE 2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), P937, DOI [10.1109/HPCC-SmartCity-DSS.2016.116, 10.1109/HPCC-SmartCity-DSS.2016.0134]
[8]  
Peng IB, 2016, PROCEEDINGS OF 2016 IEEE 18TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS
[9]  
IEEE 14TH INTERNATIONAL CONFERENCE ON SMART CITY
[10]  
IEEE 2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), P473, DOI [10.1109/HPCC-SmartCity-DSS.2016.102, 10.1109/HPCC-SmartCity-DSS.2016.0074]