PyCogent: a toolkit for making sense from sequence

被引:144
作者
Knight, Rob [1 ]
Maxwell, Peter [2 ]
Birmingham, Amanda [1 ]
Carnes, Jason [4 ]
Caporaso, J. Gregory [5 ]
Easton, Brett C. [2 ]
Eaton, Michael [6 ]
Hamady, Micah [7 ]
Lindsay, Helen [2 ]
Liu, Zongzhi
Lozupone, Catherine
McDonald, Daniel [7 ]
Robeson, Michael [1 ,8 ]
Sammut, Raymond
Smit, Sandra [1 ]
Wakefield, Matthew J. [2 ,3 ,9 ]
Widmann, Jeremy [1 ]
Wikman, Shandy [1 ]
Wilson, Stephanie [7 ]
Ying, Hua
Huttley, Gavin A. [2 ]
机构
[1] Univ Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA
[2] Australian Natl Univ, John Curtin Sch Med Res, Computat Genom Lab, Canberra, ACT 2601, Australia
[3] Thermo Fisher Sci, Lafayette, CO USA
[4] Seattle Biomed Res Inst, Seattle, WA 98109 USA
[5] Univ Colorado, Hlth Sci Ctr, Dept Biochem & Mol Genet, Aurora, CO USA
[6] Sci Applicat Int Corp, Englewood, CO USA
[7] Univ Colorado, Dept Comp Sci, Boulder, CO 80309 USA
[8] Univ Colorado, Dept Ecol & Evolut Biol, Boulder, CO 80309 USA
[9] Walter & Eliza Hall Inst Med Res, Melbourne, Vic 3050, Australia
关键词
D O I
10.1186/gb-2007-8-8-r171
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
We have implemented in Python the COmparative GENomic Toolkit, a fully integrated and thoroughly tested framework for novel probabilistic analyses of biological sequences, devising workflows, and generating publication quality graphics. PyCogent includes connectors to remote databases, built-in generalized probabilistic techniques for working with biological sequences, and controllers for third-party applications. The toolkit takes advantage of parallel architectures and runs on a range of hardware and operating systems, and is available under the general public license from http://sourceforge.net/projects/pycogent.
引用
收藏
页数:16
相关论文
共 48 条
[11]   MUSCLE: multiple sequence alignment with high accuracy and high throughput [J].
Edgar, RC .
NUCLEIC ACIDS RESEARCH, 2004, 32 (05) :1792-1797
[12]   Crystal structure of the von Willebrand factor A1 domain and implications for the binding of platelet glycoprotein Ib [J].
Emsley, J ;
Cruz, M ;
Handin, R ;
Liddington, R .
JOURNAL OF BIOLOGICAL CHEMISTRY, 1998, 273 (17) :10396-10401
[13]   A hidden Markov Model approach to variation among sites in rate of evolution [J].
Felsenstein, J ;
Churchill, GA .
MOLECULAR BIOLOGY AND EVOLUTION, 1996, 13 (01) :93-104
[14]  
Felsenstein J., PHYLIP PHYLOGENY INF
[15]   Bioconductor: open software development for computational biology and bioinformatics [J].
Gentleman, RC ;
Carey, VJ ;
Bates, DM ;
Bolstad, B ;
Dettling, M ;
Dudoit, S ;
Ellis, B ;
Gautier, L ;
Ge, YC ;
Gentry, J ;
Hornik, K ;
Hothorn, T ;
Huber, W ;
Iacus, S ;
Irizarry, R ;
Leisch, F ;
Li, C ;
Maechler, M ;
Rossini, AJ ;
Sawitzki, G ;
Smith, C ;
Smyth, G ;
Tierney, L ;
Yang, JYH ;
Zhang, JH .
GENOME BIOLOGY, 2004, 5 (10)
[16]   Mutation hotspots in the p53 gene in tumors of different origin: correlation with evolutionary conservation and signs of positive selection [J].
Glazko, GV ;
Koonin, EV ;
Rogozin, IB .
BIOCHIMICA ET BIOPHYSICA ACTA-GENE STRUCTURE AND EXPRESSION, 2004, 1679 (02) :95-106
[17]  
GOFFE WL, 1994, J ECONOMETRICS, V99, P60
[18]  
GOLDMAN N, 1994, MOL BIOL EVOL, V11, P725
[19]   Predicting the oncogenicity of missense mutations reported in the International Agency for Cancer Research (IARC) mutation database on p53 [J].
Gorlov, IP ;
Gorlova, OY ;
Amos, CI .
HUMAN MUTATION, 2005, 26 (05) :446-454
[20]   Fast-Find: A novel computational approach to analyzing combinatorial motifs [J].
Hamady, M ;
Peden, E ;
Knight, R ;
Singh, R .
BMC BIOINFORMATICS, 2006, 7 (1)