A MapReduce-Enabled Scientific Workflow Composition Framework

被引:18
作者
Fei, Xubo [1 ]
Lu, Shiyong [1 ]
Lin, Cui [1 ]
机构
[1] Wayne State Univ, Dept Comp Sci, Detroit, MI 48202 USA
来源
2009 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES, VOLS 1 AND 2 | 2009年
关键词
MANAGEMENT;
D O I
10.1109/ICWS.2009.90
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
MapReduce has recently gained a lot of attention as a parallel programming model for scalable data-intensive business and scientific analysis. In order to benefit from this powerful programming model in a scientific workflow environment, we propose a MapReduce-enabled scientific workflow composition framework consisting of: i) a dataflow-based scientific workflow model that separates the declaration of the workflow interface from the definition of its functional body; ii) a set of dataflow constructs, including Map, Reduce, Loop, and Conditional, and their composition semantics to enable MapReduce-style scientific workflows; iii) an XML-based scientific workflow specification language, called WSL, in which both Map and Reduce are fully composable with other dataflow constructs in both flat and hierarchical manners. Besides leveraging the power of MapReduce to the workflow level, our workflow composition framework is unique in that workflows are the only operands for composition; in this way, our approach elegantly solves the two-world problem of existing composition frameworks, in which composition needs to deal with both the world of tasks and the world of workflows. The proposed framework is implemented and a case study is conducted to validate our techniques.
引用
收藏
页码:663 / 670
页数:8
相关论文
共 18 条
[1]  
Andrews Tony., 2003, Business process execution language for web services
[2]  
[Anonymous], 2006, Proc. Special Interest Group on Management of Data Conf. (SIGMOD '06), DOI [10.1145/1142473.1142574, DOI 10.1145/1142473.1142574]
[3]  
[Anonymous], 2007, WEB SERVICES BUSINES
[4]   Programming scientific and distributed workflow with Triana services [J].
Churches, David ;
Gombas, Gabor ;
Harrison, Andrew ;
Maassen, Jason ;
Robinson, Craig ;
Shields, Matthew ;
Taylor, Ian ;
Wang, Ian .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2006, 18 (10) :1021-1037
[5]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[6]  
Deelman E., 2005, Scientific Programming, V13, P219
[7]  
Ekanayake J., 2008, eScience, P277, DOI DOI 10.1109/ESCIENCE.2008.59
[8]  
FEI X, 2008, JIRD, V52, P69
[9]  
Goodman D.J., 2007, WWW, P983
[10]  
Lin C., 2008, IEEE SCC, P335