Taverna: lessons in creating a workflow environment for the life sciences

被引:331
作者
Oinn, Tom
Greenwood, Mark
Addis, Matthew
Alpdemir, M. Nedim
Ferris, Justin
Glover, Kevin
Goble, Carole
Goderis, Antoon
Hull, Duncan
Marvin, Darren
Li, Peter
Lord, Phillip
Pocock, Matthew R.
Senger, Martin
Stevens, Robert
Wipat, Anil
Wroe, Chris
机构
[1] Univ Manchester, Sch Comp Sci, Manchester M13 9PL, Lancs, England
[2] EMBL European Bioinformat Inst, Cambridge CB10 1SD, England
[3] Univ Southampton, IT Innovat Ctr, Southampton SO16 7NP, Hants, England
[4] Univ Nottingham, Sch Comp Sci & Informat Technol, Nottingham NG8 1BB, England
[5] Newcastle Univ, Sch Comp Sci, Newcastle Upon Tyne NE1 7RU, Tyne & Wear, England
关键词
scientific workflow; Semantic Grid environment; life sciences; Web services;
D O I
10.1002/cpe.993
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Life sciences research is based on individuals, often with diverse skills, assembled into research groups. These groups use their specialist expertise to address scientific problems. The in silico experiments undertaken by these research groups can be represented as workflows involving the co-ordinated use of analysis programs and information repositories that may be globally distributed. With regards to Grid computing, the requirements relate to the sharing of analysis and information resources rather than sharing computational power. The (my)Grid project has developed the Taverna Workbench for the composition and execution of workflows for the life sciences community. This experience paper describes lessons learnt during the development of Taverna. A common theme is the importance of understanding how workflows fit into the scientists' experimental context. The lessons reflect an evolving understanding of life scientists' requirements on a workflow environment, which is relevant to other areas of data intensive and exploratory science. Copyright (c) 2005 John Wiley & Sons, Ltd.
引用
收藏
页码:1067 / 1100
页数:34
相关论文
共 61 条
[51]  
THATTE X, 2001, XLANG WEB SERVICES B
[52]  
*UK NAT E SCI CTR, 2003, NESC WORKSH E SCI WO
[53]   Workflow patterns [J].
Van der Aalst, WMP ;
Ter Hofstede, AHM ;
Kiepuszewski, B ;
Barros, AP .
DISTRIBUTED AND PARALLEL DATABASES, 2003, 14 (01) :5-51
[54]   XEMBL: Distributing EMBL data in XML format [J].
Wang, LC ;
Riethoven, JJ ;
Robinson, A .
BIOINFORMATICS, 2002, 18 (08) :1147-1148
[55]  
Wilkinson M., 2003, Proc. of the Virt Conf Genom and Bioinf, V3, P16
[56]   Automating experiments using semantic data on a bioinformatics grid [J].
Wroe, C ;
Goble, C ;
Greenwood, M ;
Lord, P ;
Miles, S ;
Papay, J ;
Payne, T ;
Moreau, L .
IEEE INTELLIGENT SYSTEMS, 2004, 19 (01) :48-55
[57]   A suite of DAML plus OIL ontologies to describe bioinformatics web services and data [J].
Wroe, C ;
Stevens, R ;
Goble, C ;
Roberts, A ;
Greenwood, M .
INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 2003, 12 (02) :197-224
[58]  
Zhao J, 2004, LECT NOTES COMPUT SC, V3298, P92
[59]  
BIOOPERA PROCESS SUP
[60]  
TURBOWORX