Performance Analysis of CP2K Code for Ab Initio Molecular Dynamics on CPUs and GPUs

被引:8
作者
Yokelson, Dewi [1 ]
Tkachenko, Nikolay, V [2 ]
Robey, Robert [3 ]
Li, Ying Wai [1 ]
Dub, Pavel A. [2 ]
机构
[1] Los Alamos Natl Lab, Appl Comp Sci Div CCS 7, Los Alamos, NM 87545 USA
[2] Los Alamos Natl Lab, Chem Div C IIAC, Los Alamos, NM 87545 USA
[3] Los Alamos Natl Lab, Computat Phys Div XCP 2, Los Alamos, NM 87545 USA
关键词
EXCHANGE; ACCURATE;
D O I
10.1021/acs.jcim.1c01538
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Using a realistic molecular catalyst system, we conduct scaling studies of ab initio molecular dynamics simulations using the popular CP2K code on both Intel Xeon CPU and NVIDIA V100 GPU architectures. Additional performance improvements were gained by finding more optimal process placement and affinity settings. Statistical methods were employed to understand performance changes in spite of the variability in runtime for each molecular dynamics timestep. Ideal conditions for CPU runs were found when running at least four MPI ranks per node, bound evenly across each socket. This study also showed that fully utilizing processing cores, with one OpenMP thread per core, performed better than when reserving cores for the system. The CPU-only simulations scaled at 70% or more of the ideal scaling up to 10 compute nodes, after which the returns began to diminish more quickly. Simulations on a single 40-core node with two NVIDIA V100 GPUs for acceleration achieved over 3.7x speedup compared to the fastest single 36-core node CPU-only version. These same GPU runs showed a 13% speedup over the fastest time achieved across five CPU-only nodes.
引用
收藏
页码:2378 / 2386
页数:9
相关论文
共 52 条
[31]   The ELPA library: scalable parallel eigenvalue solutions for electronic structure theory and computational science [J].
Marek, A. ;
Blum, V. ;
Johanni, R. ;
Havu, V. ;
Lang, B. ;
Auckenthaler, T. ;
Heinecke, A. ;
Bungartz, H-J ;
Lederer, H. .
JOURNAL OF PHYSICS-CONDENSED MATTER, 2014, 26 (21)
[32]   LIBXC: A library of exchange and correlation functionals for density functional theory [J].
Marques, Miguel A. L. ;
Oliveira, Micael J. T. ;
Burnus, Tobias .
COMPUTER PHYSICS COMMUNICATIONS, 2012, 183 (10) :2272-2281
[33]   PACKMOL: A Package for Building Initial Configurations for Molecular Dynamics Simulations [J].
Martinez, L. ;
Andrade, R. ;
Birgin, E. G. ;
Martinez, J. M. .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2009, 30 (13) :2157-2164
[34]   Accurate and efficient linear scaling DFT calculations with universal applicability [J].
Mohr, Stephan ;
Ratcliff, Laura E. ;
Genovese, Luigi ;
Caliste, Damien ;
Boulanger, Paul ;
Goedecker, Stefan ;
Deutsch, Thierry .
PHYSICAL CHEMISTRY CHEMICAL PHYSICS, 2015, 17 (47) :31360-31370
[35]  
N VIDIA, NVIDIA A100 TENSOR C
[36]   A UNIFIED FORMULATION OF THE CONSTANT TEMPERATURE MOLECULAR-DYNAMICS METHODS [J].
NOSE, S .
JOURNAL OF CHEMICAL PHYSICS, 1984, 81 (01) :511-519
[37]   Efficient Ab-Initio Molecular Dynamic Simulations by Offloading Fast Fourier Transformations to FPGAs [J].
Ramaswami, Arjun ;
Kenter, Tobias ;
Kuehne, Thomas D. ;
Plessl, Christian .
2020 30TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2020, :353-354
[38]  
Reid F., 2013, OPTIMISING CP2K INTE
[39]  
Robey R., 2021, Parallel and High Performance Computing
[40]  
Seewald P., LIBINT CP2K