A Dataflow-Based Scientific Workflow Composition Framework

被引:19
作者
Fei, Xubo [1 ]
Lu, Shiyong [1 ]
机构
[1] Wayne State Univ, Dept Comp Sci, Detroit, MI 48202 USA
关键词
Scientific workflow; scientific workflow model; workflow composition; MapReduce; VIEW; NEREIS-SUCCINEA; MANAGEMENT; BEHAVIOR; SYSTEM;
D O I
10.1109/TSC.2010.58
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scientific workflow has recently become an enabling technology to automate and speed up the scientific discovery process. Although several scientific workflow management systems (SWFMSs) have been developed, a formal scientific workflow composition model in which workflow constructs are fully compositional one with another is still missing. In this paper, we propose a dataflow-based scientific workflow composition framework consisting of 1) a dataflow-based scientific workflow model that separates the declaration of the workflow interface from the definition of its functional body; 2) a set of workflow constructs, including Map, Reduce, Tree, Loop, Conditional, and Curry, which are fully compositional one with another; 3) a dataflow-based exception handling approach to support hierarchical exception propagation and user-defined exception handling. Our workflow composition framework is unique in that workflows are the only operands for composition; in this way, our approach elegantly solves the two-world problem in existing composition frameworks, in which composition needs to deal with both the world of tasks and the world of workflows. The proposed framework is implemented and several case studies are conducted to validate our techniques.
引用
收藏
页码:45 / 58
页数:14
相关论文
共 46 条
[1]  
Adams M., 2005, Proceedings of the CAiSE'05 FORUM. 17th Conference on Advanced Information Systems Engineering, P45
[2]  
Andrews Tony., 2003, Business process execution language for web services
[3]  
[Anonymous], 2006, Proc. Special Interest Group on Management of Data Conf. (SIGMOD '06), DOI [10.1145/1142473.1142574, DOI 10.1145/1142473.1142574]
[4]  
[Anonymous], 2007, Workflows for E-Science: Scientific Workflows for Grids
[5]  
[Anonymous], 2011, AM EL MAPREDUCE
[6]  
[Anonymous], 2011, INTRO AMAZON ELASTIC
[7]  
Bowers S., 2006, INT C DATA ENG WORKS, P70
[8]  
Chervenak A, 2005, 2005 6th International Workshop on Grid Computing (GRID), P1
[9]   Programming scientific and distributed workflow with Triana services [J].
Churches, David ;
Gombas, Gabor ;
Harrison, Andrew ;
Maassen, Jason ;
Robinson, Craig ;
Shields, Matthew ;
Taylor, Ian ;
Wang, Ian .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2006, 18 (10) :1021-1037
[10]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137