Scaling up workflow-based applications

被引:33
作者
Callaghan, Scott [2 ]
Deelman, Ewa [1 ]
Gunter, Dan [5 ]
Juve, Gideon [2 ]
Maechling, Philip [2 ]
Brooks, Christopher [6 ]
Vahi, Karan [1 ]
Milner, Kevin [2 ]
Graves, Robert [3 ]
Field, Edward [4 ]
Okaya, David [2 ]
Jordan, Thomas [2 ]
机构
[1] USC Informat Sci Inst, Marina Del Rey, CA 90292 USA
[2] Univ So Calif, Los Angeles, CA 90089 USA
[3] URS Corp, Pasadena, CA 91101 USA
[4] US Geol Survey, Pasadena, CA 91106 USA
[5] Univ Calif Berkeley, Lawrence Berkeley Lab, Berkeley, CA 94720 USA
[6] Univ San Francisco, San Francisco, CA 94117 USA
基金
美国国家科学基金会;
关键词
Scientific workflows; Distributed applications; Workflow scalability;
D O I
10.1016/j.jcss.2009.11.005
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Scientific applications, often expressed as workflows are making use of large-scale national cyberinfrastructure to explore the behavior of systems, search for phenomena in large-scale data, and to conduct many other scientific endeavors As the complexity of the systems being studied grows and as the data set sizes Increase, the scale of the computational workflows increases as well. In some cases, workflows now have hundreds of thousands of individual tasks Managing such scale is difficult from the point of view of workflow description, execution, and analysis In this paper, we describe the challenges faced by workflow management and performance analysis systems when dealing with an earthquake science application. CyberShake, executing on the TeraGrid. The scientific goal of the SCEC CyberShake project is to calculate probabilistic seismic hazard curves for sites in Southern California. For each site of interest, the CyberShake platform includes two large-scale MPI calculations and approximately 840,000 embarrassingly parallel post-processing jobs. In this paper, we show how we approach the scalability challenges in our workflow management and log mining systems. (C) 2009 Elsevier Inc. All rights reserved.
引用
收藏
页码:428 / 446
页数:19
相关论文
共 50 条
[1]  
ANDREWS T, 2003, SPECIFICATION BUSINE
[2]  
[Anonymous], CONDOR GLIDEIN
[3]  
[Anonymous], DAGMAN
[4]  
[Anonymous], 2006, 2006 2 IEEE INT C E
[5]  
[Anonymous], P 8 IFIP IEEE INT S
[6]  
[Anonymous], 2008, 3 WORKSH WORKFL SUPP
[7]  
BERRIMAN B, 2003, ASTRONOMICAL DAT ANA, V13
[8]  
BERRIMAN GB, 2006, WORKFLOWS E SCI
[9]  
BERRIMAN GB, 2004, SPIE C, V5487
[10]  
BROWN DA, 2006, WORKFLOWS E SCI