IntelliGEN: A Distributed Workflow System for Discovering Protein-Protein Interactions

被引:0
作者
Krys Kochut
Jonathan Arnold
Amit Sheth
John Miller
Eileen Kraemer
Budak Arpinar
Jorge Cardoso
机构
[1] University of Georgia,Department of Computer Science
[2] University of Georgia,Department of Genetics
来源
Distributed and Parallel Databases | 2003年 / 13卷
关键词
workflow management; biological process; bioinformatics; protein-protein interaction; laboratory information management;
D O I
暂无
中图分类号
学科分类号
摘要
A large genomics project involves a significant number of researchers and technicians performing dozens of tasks, either manual (e.g. performing laboratory experiments), computer assisted (e.g. looking for genes in the GENBANK database), or sometimes performed entirely automatically by the computer (e.g. sequence assembly). It has become apparent that managing such projects poses overwhelming problems and may lead to results of lower or even unacceptable quality, or possibly drastically increased project costs. In this paper, we present a design and an initial implementation of a distributed workflow system created to schedule and support activities in a genomics laboratory. The focus of the activities in the laboratory is the discovery of protein-protein interactions of fungi, specifically Neurospora crassa. We present our approach of developing, adapting and applying workflow technology in the genomics lab and illustrate it using one distinct part of a larger workflow to discover protein-protein interactions. Novel features of our system include the ability to monitor the quality and timeliness of the results and if necessary, suggesting and incorporating changes to the selected tasks and their scheduling.
引用
收藏
页码:43 / 72
页数:29
相关论文
共 171 条
[1]  
Aalst W.(2000)Dealing withworkflowchange: Identification of issues and solutions International Journal of Computer Systems, Science, and Engineering 15 267-276
[2]  
Jablonski S.(1997)Constructing a physical map of the Fungal Genetics and Biology 21 254-257
[3]  
Arnold J.(1997) genome J. Euk. Microbiol. 44 8S-506
[4]  
Arnold J.(1941)Genetic control of biochemical reactions in Neurospora Proceedings of the National Academy of Sciences, USA 27 499-387
[5]  
Cushion M.T.(1999)Emergent properties of networks of biological signaling pathways Science 283 381-1112
[6]  
Beadle G.W.(1994)Parallel simulated annealing on the hypercube for chromosome reconstruction, invited paper Proc 14th IMACS World Congress on Computational and Applied Mathematics, Atlanta, GA 3 1109-1204
[7]  
Tatum E.L.(1998)Parallel computing for chromosome reconstruction via ordering of DNA sequences Parallel Computing 24 1177-1043
[8]  
Bhalla U.S.(2001)Parallel computation of a maximum likelihood estimator of a physical map Genetics 157 1021-478
[9]  
Iyengar R.(1996)LabFlow-1: A Database benchmark for high-throughput workflow management Proceedings, Fifth International Conference on Extending Database Technology (EDBT), Avignon, France, March 1057 463-474
[10]  
Bhandarkar S.M.(1992)CMAP: Contig mapping and analysis package: A relational database for chromosome reconstruction CABIOS 8 467-686