共 50 条
Graphics Processing Unit Acceleration and Parallelization of GENESIS for Large-Scale Molecular Dynamics Simulations
被引:22
|作者:
Jung, Jaewoon
[1
,2
]
Naurse, Akira
[3
]
Kobayashi, Chigusa
[2
]
Sugita, Yuji
[1
,2
,4
,5
]
机构:
[1] RIKEN Theoret Mol Sci Lab, 2-1 Hirosawa, Wako, Saitama 3510198, Japan
[2] RIKEN Adv Inst Computat Sci, Chuo Ku, 7-1-26 Minatojima Minamimachi, Kobe, Hyogo 6400047, Japan
[3] NVIDIA, Minato Ku, 2-11-7 Akasaka, Tokyo 1070052, Japan
[4] RIKEN iTHES, 2-1 Hirosawa, Wako, Saitama 3510198, Japan
[5] RIKEN Quantitat Biol Ctr QBiC, Lab Biomol Funct Simulat, Chuo Ku, 6-7-1 Minatojima Minamimachi, Kobe, Hyogo 6500047, Japan
关键词:
PARTICLE MESH EWALD;
FORCE-FIELD;
ENERGETICS;
EFFICIENT;
AMBER;
IMPLEMENTATION;
ALGORITHMS;
SYSTEMS;
SCHEMES;
CHANNEL;
D O I:
10.1021/acs.jctc.6b00241
中图分类号:
O64 [物理化学(理论化学)、化学物理学];
学科分类号:
070304 ;
081704 ;
摘要:
The graphics processing unit (GPU) has become a popular computational platform for molecular dynamics (MD) simulations of biomolecules. A significant speedup in the simulations of small- or medium-size systems using only a few computer nodes with a single or multiple GPUs has been reported. Because of GPU memory limitation and slow communication between GPUs on different computer nodes, it is not straightforward to accelerate MD simulations of large biological systems that contain a few million or more atoms on massively parallel supercomputers with GPUs. In this study, we develop a new scheme in our MD software, GENESIS, to reduce the, total computational time on such computers. Computationally intensive real-space nonbonded interactions are computed mainly on GPUs in the scheme, while less intensive bonded interactions and communication intensive reciprocal-space interactions are performed on CPUs. On the basis of the midpoint cell method as a domain decomposition scheme, we invent the single particle interaction list for reducing the GPU memory usage. Since total computational time is limited by the reciprocal-space computation, we utilize the RESPA multiple time-step integration and reduce the CPU resting time by assigning a subset of nonbonded interactions on CPUs as well as on GPUs when the reciprocal space computation is skipped. We validated our GPU implementations in GENESIS on BPTI and a membrane protein, porin, by MD simulations and an alanine-tripeptide by REMD simulations. Benchmark calculations on TSUBAME supercomputer showed that an MD simulation of a million atoms system was scalable up to 256 computer nodes with GPUs.
引用
收藏
页码:4947 / 4958
页数:12
相关论文