Sassena - X-ray and neutron scattering calculated from molecular dynamics trajectories using massively parallel computers

被引:50
作者
Lindner, Benjamin [1 ,2 ]
Smith, Jeremy C. [2 ,3 ]
机构
[1] Univ Tennessee, Knoxville, TN 37996 USA
[2] Univ Tennessee, Oak Ridge Natl Lab, Ctr Biophys Mol, Oak Ridge, TN 37830 USA
[3] Univ Tennessee, Dept Biochem & Cellular & Mol Biol, Knoxville, TN 37996 USA
基金
美国国家科学基金会;
关键词
X-ray; Neutron; Scattering; Molecular dynamics; Massively parallel; SIMULATIONS; CELLULOSE; PROGRAM; LYSOZYME; MOTIONS;
D O I
10.1016/j.cpc.2012.02.010
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Massively parallel computers now permit the molecular dynamics (MD) simulation of multi-million atom systems on time scales up to the microsecond. However, the subsequent analysis of the resulting simulation trajectories has now become a high performance computing problem in itself. Here, we present software for calculating X-ray and neutron scattering intensities from MD simulation data that scales well on massively parallel supercomputers. The calculation and data staging schemes used maximize the degree of parallelism and minimize the IO bandwidth requirements. The strong scaling tested on the jaguar Petaflop Cray XT5 at Oak Ridge National Laboratory exhibits virtually linear scaling up to 7000 cores for most benchmark systems. Since both MPI and thread parallelism is supported, the software is flexible enough to cover scaling demands for different types of scattering calculations. The result is a high performance tool capable of unifying large-scale supercomputing and a wide variety of neutron/synchrotron technology. Program summary Program title: Sassena Catalogue identifier: AELW_v1_0 Program summary URL: http://cpc.cs.qub.ac.uk/summaries/AELW_v1_0.html Program obtainable from: CPC Program Library, Queen's University, Belfast, N. Ireland Licensing provisions: GNU General Public License, version 3 No. of lines in distributed program, including test data, etc.: 1 003 742 No. of bytes in distributed program, including test data, etc.: 798 Distribution format: tar.gz Programming language: C++, OpenMPI Computer: Distributed Memory, Cluster of Computers with high performance network, Supercomputer Operating system: UNIX, LINUX, OSX Has the code been vectorized or parallelized?: Yes, the code has been parallelized using MPI directives. Tested with up to 7000 processors RAM: Up to 1 Gbytes/core Classification: 6.5, 8 External routines: Boost Library, FFTW3, CMAKE, GNU C++ Compiler, OpenMPI, LibXML, LAPACK Nature of problem: Recent developments in supercomputing allow molecular dynamics simulations to generate large trajectories spanning millions of frames and thousands of atoms. The structural and dynamical analysis of these trajectories requires analysis algorithms which use parallel computation and IO schemes to solve the computational task in a practical amount of time. The particular computational and IO requirements very much depend on the particular analysis algorithm. In scattering calculations a very frequent pattern is that the trajectory data is used multiple times to compute different projections and aggregates this into a single scattering function. Thus, for good performance the trajectory data has to be kept in memory and the parallel computer has to have enough RAM to store a volatile version of the whole trajectory. In order to achieve high performance and good scalability the mapping of the physical equations to a parallel computer needs to consider data locality and reduce the amount of the inter-node communication. Solution method: The physical equations for scattering calculations were analyzed and two major calculation schemes were developed to support any type of scattering calculation (all/self). Certain hardware aspects were taken into account, e.g. high performance computing clusters and supercomputers usually feature a 2 tier network system, with Ethernet providing the file storage and infiniband the inter-node communication via MPI calls. The time spent loading the trajectory data into memory is minimized by letting each core only read the trajectory data it requires. The performance of inter-node communication is maximized by exclusively utilizing the appropriate MPI calls to exchange the necessary data, resulting in an excellent scalability. The partitioning scheme developed to map the calculation onto a parallel computer covers a wide variety of use cases without negatively effecting the achieved performance. This is done through a 2D partitioning scheme where independent scattering vectors are assigned to independent parallel partitions and all communication is local to the partition. Additional comments: !!!!! The distribution file for this program is approximately 36 Mbytes and therefore is not delivered directly when download or E-mail is requested. Instead an html file giving details of how the program can be obtained is sent. !!!!! Running time: Usual runtime spans from 1 min on 20 nodes to 2 h on 2000 nodes. That is 0.5-4000 CPU hours per execution. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:1491 / 1501
页数:11
相关论文
共 50 条
  • [1] Convergence properties of X-ray scattering calculated from protein crystal molecular dynamics simulations
    Meinhold, L
    Lammers, S
    Becker, T
    Smith, JC
    PHYSICA B-CONDENSED MATTER, 2004, 350 (1-3) : 127 - 131
  • [2] SERENA - A PROGRAM FOR CALCULATING X-RAY DIFFUSE-SCATTERING INTENSITIES FROM MOLECULAR-DYNAMICS TRAJECTORIES
    MICU, AM
    SMITH, JC
    COMPUTER PHYSICS COMMUNICATIONS, 1995, 91 (1-3) : 331 - 338
  • [3] Strategies for the Development of Conjugated Polymer Molecular Dynamics Force Fields Validated with Neutron and X-ray Scattering
    Wolf, Caitlyn M.
    Guio, Lorenzo
    Scheiwiller, Sage
    Pakhnyuk, Viktoria
    Luscombe, Christine
    Pozzo, Lilo D.
    ACS POLYMERS AU, 2021, 1 (03): : 134 - 152
  • [4] On the use of molecular dynamics simulation to calculate X-ray thermal diffuse scattering from molecular crystals
    Chan, E. J.
    JOURNAL OF APPLIED CRYSTALLOGRAPHY, 2015, 48 : 1420 - 1428
  • [5] Understanding lanthanum aluminate glass structure by correlating molecular dynamics simulation results with neutron and X-ray scattering data
    Du, Jincheng
    Corrales, L. Rene
    JOURNAL OF NON-CRYSTALLINE SOLIDS, 2007, 353 (02) : 210 - 214
  • [6] Conformational dynamics of a crystalline protein from microsecond-scale molecular dynamics simulations and diffuse X-ray scattering
    Wall, Michael E.
    Van Benschoten, Andrew H.
    Sauter, Nicholas K.
    Adams, Paul D.
    Fraser, James S.
    Terwilliger, Thomas C.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2014, 111 (50) : 17887 - 17892
  • [7] viewSq, a Visual Molecular Dynamics (VMD) module for calculating, analyzing, and visualizing X-ray and neutron structure factors from atomistic simulations
    Mackoy, Travis
    Kale, Bharat
    Papka, Michael E.
    Wheeler, Ralph A.
    COMPUTER PHYSICS COMMUNICATIONS, 2021, 264
  • [8] Choline salicylate ionic liquid by X-ray scattering, vibrational spectroscopy and molecular dynamics
    Tanzi, Luana
    Nardone, Michele
    Benassi, Paola
    Ramondo, Fabio
    Caminiti, Ruggero
    Gontrani, Lorenzo
    JOURNAL OF MOLECULAR LIQUIDS, 2016, 218 : 39 - 49
  • [9] Concentration effects on aqueous lithium chloride solutions. Molecular dynamics simulations and x-ray scattering studies
    Bouazizi, Salah
    Nasr, Salah
    JOURNAL OF MOLECULAR LIQUIDS, 2014, 197 : 77 - 83
  • [10] Conformations of myosin subfragment 1 ATPase intermediates from neutron and X-ray scattering
    Mendelson, RA
    Schneider, DK
    Stone, DB
    JOURNAL OF MOLECULAR BIOLOGY, 1996, 256 (01) : 1 - 7