Performance modeling of microsecond scale biological molecular dynamics simulations on heterogeneous architectures

被引:10
|
作者
Agarwal, Pratul K. [1 ]
Hampton, Scott
Poznanovic, Jeffrey [2 ]
Ramanthan, Arvind [1 ]
Alam, Sadaf R. [2 ]
Crozier, Paul S. [3 ]
机构
[1] Oak Ridge Natl Lab, Oak Ridge, TN 37831 USA
[2] Swiss Natl Supercomp Ctr, Manno, Switzerland
[3] Sandia Natl Labs, Albuquerque, NM 87185 USA
基金
美国能源部;
关键词
performance modeling; GPUs; molecular dynamics; GRAPHICS;
D O I
10.1002/cpe.2943
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Performance improvements in biomolecular simulations based on molecular dynamics (MD) codes are widely desired. Unfortunately, the factors, which allowed past performance improvements, particularly the microprocessor clock frequencies, are no longer increasing. Hence, novel software and hardware solutions are being explored for accelerating performance of widely used MD codes. In this paper, we describe our efforts on porting, optimizing and tuning of Large-scale Atomic/Molecular Massively Parallel Simulator, a popular MD framework, on heterogeneous architectures: multi-core processors with graphical processing unit (GPU) accelerators. Our implementation is based on accelerating the most computationally expensive non-bonded interaction terms on the GPUs and overlapping the computation on the CPU and GPUs. This functionality is built on top of message passing interface that allows multi-level parallelism to be extracted even at the workstation level with the multi-core CPUs and allows extension of the implementation on GPU-enabled clusters. We hypothesize that the optimal benefit of heterogeneous architectures for applications will come by utilizing all possible resources (for example, CPU-cores and GPU devices on GPU-enabled clusters). Benchmarks for a range of biomolecular system sizes are provided, and an analysis is performed on four generations of NVIDIA's GPU devices. On GPU-enabled Linux clusters, by overlapping and pipelining computation and communication, we observe up to 10-folds application acceleration in multi-core and multi-GPU environments illustrating significant performance improvements. Detailed analysis of the implementation is presented that allows identification of bottlenecks in algorithm, indicating that code optimization and improvements on GPUs could allow microsecond scale simulation throughput on workstations and inexpensive GPU clusters, putting widely desired biologically relevant simulation time-scales within reach of a large user community. In order to systematically optimize simulation throughput and to enable performance prediction, we have developed a parameterized performance model that will allow developers and users to explore the performance potential of future heterogeneous systems for biological simulations. Copyright (C) 2012 John Wiley & Sons, Ltd.
引用
收藏
页码:1356 / 1375
页数:20
相关论文
共 50 条
  • [1] Load Balancing for Molecular Dynamics Simulations on Heterogeneous Architectures
    Seckler, Steffen
    Tchipev, Nikola
    Bungartz, Hans-Joachim
    Neumann, Philipp
    PROCEEDINGS OF 2016 IEEE 23RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2016, : 101 - 110
  • [2] Molecular dynamics insights into protein-glycosaminoglycan systems from microsecond-scale simulations
    Bojarski, Krzysztof K.
    Sieradzan, Adam K.
    Samsonov, Sergey A.
    BIOPOLYMERS, 2019, 110 (07)
  • [3] Triplex intermediates in folding of human telomeric quadruplexes probed by microsecond-scale molecular dynamics simulations
    Stadlbauer, Petr
    Trantirek, Lukas
    Cheatham, Thomas E., III
    Koca, Jaroslav
    Sponer, Jiri
    BIOCHIMIE, 2014, 105 : 22 - 35
  • [4] Modeling Data Movement Performance on Heterogeneous Architectures
    Bienz, Amanda
    Olson, Luke N.
    Gropp, William D.
    Lockhart, Shelby
    2021 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2021,
  • [5] Trajectories of microsecond molecular dynamics simulations of nucleosomes and nucleosome core particles
    Shaytan, Alexey K.
    Armeev, Grigoriy A.
    Goncearenco, Alexander
    Zhurkin, Victor B.
    Landsman, David
    Panchenko, Anna R.
    DATA IN BRIEF, 2016, 7 : 1678 - 1681
  • [6] Accelerating Molecular Dynamics Simulations on Heterogeneous Architecture
    Wang, Yueqing
    Dou, Yong
    Guo, Song
    Lei, Yuanwu
    Li, Baofeng
    Wang, Qiang
    COMPUTER ENGINEERING AND TECHNOLOGY, 2016, 592 : 118 - 132
  • [7] Function portability of molecular dynamics on heterogeneous parallel architectures with OpenCL
    Halver, Rene
    Homberg, Wilhelm
    Sutmann, Godehard
    JOURNAL OF SUPERCOMPUTING, 2018, 74 (04) : 1522 - 1533
  • [8] Simulations of biological ion channels by molecular dynamics
    Beu, TA
    JOURNAL OF OPTOELECTRONICS AND ADVANCED MATERIALS, 2006, 8 (01): : 160 - 163
  • [9] Function portability of molecular dynamics on heterogeneous parallel architectures with OpenCL
    Rene Halver
    Wilhelm Homberg
    Godehard Sutmann
    The Journal of Supercomputing, 2018, 74 : 1522 - 1533
  • [10] Parallel short range molecular dynamics simulations on computer clusters: Performance evaluation and modeling
    Karakasidis, TE
    Cholevas, NS
    Liakopoulos, AB
    MATHEMATICAL AND COMPUTER MODELLING, 2005, 42 (7-8) : 783 - 798