Hadoop-based replica exchange over heterogeneous distributed cyberinfrastructures

被引:4
作者
Platania, Richard [1 ,2 ]
Shams, Shayan [1 ,2 ]
Chiu, Chui-Hui [1 ,2 ]
Kim, Nayong [2 ]
Kim, Joohyun [2 ]
Park, Seung-Jong [1 ,2 ]
机构
[1] Louisiana State Univ, Sch EECS, Baton Rouge, LA 70803 USA
[2] Louisiana State Univ, Ctr Computat & Technol, Baton Rouge, LA 70803 USA
基金
美国国家科学基金会;
关键词
distributed cyberinfrastructure; replica exchange; enhanced conformational sampling; replica exchange statistical temperature molecular dynamics (RESTMD); Hadoop MapReduce; GENI; MOLECULAR-DYNAMICS SIMULATIONS; MAPREDUCE; TOOLKIT;
D O I
10.1002/cpe.3878
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We present Hadoop-based replica exchange (HaRE), a Hadoop-based implementation of the replica exchange scheme developed primarily for replica exchange statistical temperature molecular dynamics, an example of a large-scale, advanced sampling molecular dynamics simulation. By using Hadoop as a framework and the MapReduce model for driving replica exchange, an efficient task-level parallelism is introduced to replica exchange statistical temperature molecular dynamics simulations. In order to demonstrate this, we investigate the performance of our application over various distributed cyberinfrastructures (DCI), including several high-performance computing systems, our cyberinfrastructure for reconfigurable optical networks testbed, the global environment for network innovations testbed, and the CloudLab testbed. Scalability performance analysis is shown in terms of scale-out and scale-up over a single high-performance computing cluster, EC2, and CloudLab and scale-across with cyberinfrastructure for reconfigurable optical networks and global environment for network innovations. As a result, we demonstrate that HaRE is capable of efficient execution over both homogeneous and heterogeneous DCI of varying size and configuration. Contributing factors to performance are discussed in order to provide insight towards the effects of computing environment on the execution of HaRE. With these contributions, we propose that similar loosely coupled scientific applications can also take advantage of the scalable, task-level parallelism Hadoop MapReduce provides over various DCI. Copyright (C) 2016 John Wiley & Sons, Ltd.
引用
收藏
页数:14
相关论文
共 36 条
[1]   Parallel metropolis coupled Markov chain Monte Carlo for Bayesian phylogenetic inference [J].
Altekar, G ;
Dwarkadas, S ;
Huelsenbeck, JP ;
Ronquist, F .
BIOINFORMATICS, 2004, 20 (03) :407-415
[2]  
[Anonymous], 2010, P 19 ACM INT S HIGH, DOI DOI 10.1145/1851476.1851593
[3]  
[Anonymous], 2004, OSDI 04
[4]   GENI: A federated testbed for innovative network experiments [J].
Berman, Mark ;
Chase, Jeffrey S. ;
Landweber, Lawrence ;
Nakao, Akihiro ;
Ott, Max ;
Raychaudhuri, Dipankar ;
Ricci, Robert ;
Seskar, Ivan .
COMPUTER NETWORKS, 2014, 61 :5-23
[5]   Enhanced sampling techniques in molecular dynamics simulations of biological systems [J].
Bernardi, Rafael C. ;
Melo, Marcelo C. R. ;
Schulten, Klaus .
BIOCHIMICA ET BIOPHYSICA ACTA-GENERAL SUBJECTS, 2015, 1850 (05) :872-877
[6]   CHARMM: The Biomolecular Simulation Program [J].
Brooks, B. R. ;
Brooks, C. L., III ;
Mackerell, A. D., Jr. ;
Nilsson, L. ;
Petrella, R. J. ;
Roux, B. ;
Won, Y. ;
Archontis, G. ;
Bartels, C. ;
Boresch, S. ;
Caflisch, A. ;
Caves, L. ;
Cui, Q. ;
Dinner, A. R. ;
Feig, M. ;
Fischer, S. ;
Gao, J. ;
Hodoscek, M. ;
Im, W. ;
Kuczera, K. ;
Lazaridis, T. ;
Ma, J. ;
Ovchinnikov, V. ;
Paci, E. ;
Pastor, R. W. ;
Post, C. B. ;
Pu, J. Z. ;
Schaefer, M. ;
Tidor, B. ;
Venable, R. M. ;
Woodcock, H. L. ;
Wu, X. ;
Yang, W. ;
York, D. M. ;
Karplus, M. .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2009, 30 (10) :1545-1614
[7]  
Bu YY, 2010, PROC VLDB ENDOW, V3, P285
[8]  
Dede E, 2015, IEEE T SERV COMPUT, P1
[9]  
Dede E., 2012, E-Science (e-Science), 2012 IEEE 8th International Conference on, P1
[10]   Dependency-Aware Data Locality for MapReduce [J].
Fan, Xiaoyi ;
Ma, Xiaoqiang ;
Liu, Jiangchuan ;
Li, Dan .
2014 IEEE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2014, :409-416