The Ensembl pipeline is an extension to the Ensembl system which allows automated annotation of genomic sequence. The software comprises two parts. First, there is a set of Perl modules ("Runnables" and "RunnableDBs") which are 'wrappers' for a variety of commonly used analysis tools. These retrieve sequence data from a relational database, run the analysis, and write the results back to the database. They inherit from a common interface, which simplifies the writing of new wrapper modules. On top of this sits a job Submission system (the "RuleManager") which allows efficient and reliable submission of large numbers of jobs to a compute farm. Here we describe the fundamental software components of the pipeline, and we also highlight some features of the Sanger installation which were necessary to enable the pipeline to scale to whole-genome analysis.
机构:
Univ Calif Santa Cruz, Dept Mol Cellular & Dev Biol, Santa Cruz, CA 95064 USAUniv Calif Santa Cruz, Dept Mol Cellular & Dev Biol, Santa Cruz, CA 95064 USA
Kent, WJ
;
Sugnet, CW
论文数: 0引用数: 0
h-index: 0
机构:Univ Calif Santa Cruz, Dept Mol Cellular & Dev Biol, Santa Cruz, CA 95064 USA
Sugnet, CW
;
Furey, TS
论文数: 0引用数: 0
h-index: 0
机构:Univ Calif Santa Cruz, Dept Mol Cellular & Dev Biol, Santa Cruz, CA 95064 USA
Furey, TS
;
Roskin, KM
论文数: 0引用数: 0
h-index: 0
机构:Univ Calif Santa Cruz, Dept Mol Cellular & Dev Biol, Santa Cruz, CA 95064 USA
Roskin, KM
;
Pringle, TH
论文数: 0引用数: 0
h-index: 0
机构:Univ Calif Santa Cruz, Dept Mol Cellular & Dev Biol, Santa Cruz, CA 95064 USA
Pringle, TH
;
Zahler, AM
论文数: 0引用数: 0
h-index: 0
机构:Univ Calif Santa Cruz, Dept Mol Cellular & Dev Biol, Santa Cruz, CA 95064 USA
Zahler, AM
;
Haussler, D
论文数: 0引用数: 0
h-index: 0
机构:Univ Calif Santa Cruz, Dept Mol Cellular & Dev Biol, Santa Cruz, CA 95064 USA
机构:
Univ Calif Santa Cruz, Dept Mol Cellular & Dev Biol, Santa Cruz, CA 95064 USAUniv Calif Santa Cruz, Dept Mol Cellular & Dev Biol, Santa Cruz, CA 95064 USA
Kent, WJ
;
Sugnet, CW
论文数: 0引用数: 0
h-index: 0
机构:Univ Calif Santa Cruz, Dept Mol Cellular & Dev Biol, Santa Cruz, CA 95064 USA
Sugnet, CW
;
Furey, TS
论文数: 0引用数: 0
h-index: 0
机构:Univ Calif Santa Cruz, Dept Mol Cellular & Dev Biol, Santa Cruz, CA 95064 USA
Furey, TS
;
Roskin, KM
论文数: 0引用数: 0
h-index: 0
机构:Univ Calif Santa Cruz, Dept Mol Cellular & Dev Biol, Santa Cruz, CA 95064 USA
Roskin, KM
;
Pringle, TH
论文数: 0引用数: 0
h-index: 0
机构:Univ Calif Santa Cruz, Dept Mol Cellular & Dev Biol, Santa Cruz, CA 95064 USA
Pringle, TH
;
Zahler, AM
论文数: 0引用数: 0
h-index: 0
机构:Univ Calif Santa Cruz, Dept Mol Cellular & Dev Biol, Santa Cruz, CA 95064 USA
Zahler, AM
;
Haussler, D
论文数: 0引用数: 0
h-index: 0
机构:Univ Calif Santa Cruz, Dept Mol Cellular & Dev Biol, Santa Cruz, CA 95064 USA