BUSCO Applications from Quality Assessments to Gene Prediction and Phylogenomics

被引:1530
作者
Waterhouse, Robert M. [1 ,2 ,3 ]
Seppey, Mathieu [1 ,2 ]
Simao, Felipe A. [1 ,2 ]
Manni, Mose [1 ,2 ]
Ioannidis, Panagiotis [1 ,2 ]
Klioutchnikov, Guennadi [1 ,2 ]
Kriventseva, Evgenia V. [1 ,2 ]
Zdobnov, Evgeny M. [1 ,2 ]
机构
[1] Univ Geneva, Dept Genet Med & Dev, Med Sch, Geneva, Switzerland
[2] Swiss Inst Bioinformat, Geneva, Switzerland
[3] Univ Lausanne, Dept Ecol & Evolut, Lausanne, Switzerland
基金
瑞士国家科学基金会;
关键词
transcriptomics; metagenomics; bioinformatics; evolution; GENOME; ORTHOLOGS; COMPLETENESS; IMPROVEMENTS; EVOLUTIONARY; BACTERIAL; ORTHODB; FUNGAL; PLANT; TOOL;
D O I
10.1093/molbev/msx319
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Genomics promises comprehensive surveying of genomes and metagenomes, but rapidly changing technologies and expanding data volumes make evaluation of completeness a challenging task. Technical sequencing quality metrics can be complemented by quantifying completeness of genomic data sets in terms of the expected gene content of Benchmarking Universal Single-Copy Orthologs (BUSCO, http://busco.ezlab.org). The latest software release implements a complete refactoring of the code to make it more flexible and extendable to facilitate high-throughput assessments. The original six lineage assessment data sets have been updated with improved species sampling, 34 new subsets have been built for vertebrates, arthropods, fungi, and prokaryotes that greatly enhance resolution, and data sets are now also available for nematodes, protists, and plants. Here, we present BUSCO v3 with example analyses that highlight the wide-ranging utility of BUSCO assessments, which extend beyond quality control of genomics data sets to applications in comparative genomics analyses, gene predictor training, metagenomics, and phylogenomics.
引用
收藏
页码:543 / 548
页数:6
相关论文
共 30 条
[1]   Rodent phylogeny revised: analysis of six nuclear genes from all major rodent clades [J].
Blanga-Kanfi, Shani ;
Miranda, Hector ;
Penn, Osnat ;
Pupko, Tal ;
DeBry, Ronald W. ;
Huchon, Dorothee .
BMC EVOLUTIONARY BIOLOGY, 2009, 9
[2]   BLAST plus : architecture and applications [J].
Camacho, Christiam ;
Coulouris, George ;
Avagyan, Vahram ;
Ma, Ning ;
Papadopoulos, Jason ;
Bealer, Kevin ;
Madden, Thomas L. .
BMC BIOINFORMATICS, 2009, 10
[3]   trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses [J].
Capella-Gutierrez, Salvador ;
Silla-Martinez, Jose M. ;
Gabaldon, Toni .
BIOINFORMATICS, 2009, 25 (15) :1972-1973
[4]   Major Improvements to the Heliconius melpomene Genome Assembly Used to Confirm 10 Chromosome Fusion Events in 6 Million Years of Butterfly Evolution [J].
Davey, John W. ;
Chouteau, Mathieu ;
Barker, Sarah L. ;
Maroja, Luana ;
Baxter, Simon W. ;
Simpson, Fraser ;
Joron, Mathieu ;
Mallet, James ;
Dasmahapatra, Kanchon K. ;
Jiggins, Chris D. .
G3-GENES GENOMES GENETICS, 2016, 6 (03) :695-708
[5]   Accelerated Profile HMM Searches [J].
Eddy, Sean R. .
PLOS COMPUTATIONAL BIOLOGY, 2011, 7 (10)
[6]   Finding the missing honey bee genes: lessons learned from a genome upgrade [J].
Elsik, Christine G. ;
Worley, Kim C. ;
Bennett, Anna K. ;
Beye, Martin ;
Camara, Francisco ;
Childers, Christopher P. ;
de Graaf, Dirk C. ;
Debyser, Griet ;
Deng, Jixin ;
Devreese, Bart ;
Elhaik, Eran ;
Evans, Jay D. ;
Foster, Leonard J. ;
Graur, Dan ;
Guigo, Roderic ;
Hoff, Katharina Jasmin ;
Holder, Michael E. ;
Hudson, Matthew E. ;
Hunt, Greg J. ;
Jiang, Huaiyang ;
Joshi, Vandita ;
Khetani, Radhika S. ;
Kosarev, Peter ;
Kovar, Christie L. ;
Ma, Jian ;
Maleszka, Ryszard ;
Moritz, Robin F. A. ;
Munoz-Torres, Monica C. ;
Murphy, Terence D. ;
Muzny, Donna M. ;
Newsham, Irene F. ;
Reese, Justin T. ;
Robertson, Hugh M. ;
Robinson, Gene E. ;
Rueppell, Olav ;
Solovyev, Victor ;
Stanke, Mario ;
Stolle, Eckart ;
Tsuruda, Jennifer M. ;
Van Vaerenbergh, Matthias ;
Waterhouse, Robert M. ;
Weaver, Daniel B. ;
Whitfield, Charles W. ;
Wu, Yuanqing ;
Zdobnov, Evgeny M. ;
Zhang, Lan ;
Zhu, Dianhui ;
Gibbs, Richard A. .
BMC GENOMICS, 2014, 15
[7]   Phylogenomic Analysis of Spiders Reveals Nonmonophyly of Orb Weavers [J].
Fernandez, Rosa ;
Hormiga, Gustavo ;
Giribet, Gonzalo .
CURRENT BIOLOGY, 2014, 24 (15) :1772-1777
[8]   Extensive introgression in a malaria vector species complex revealed by phylogenomics [J].
Fontaine, Michael C. ;
Pease, James B. ;
Steele, Aaron ;
Waterhouse, Robert M. ;
Neafsey, Daniel E. ;
Sharakhov, Igor V. ;
Jiang, Xiaofang ;
Hall, Andrew B. ;
Catteruccia, Flaminia ;
Kakani, Evdoxia ;
Mitchell, Sara N. ;
Wu, Yi-Chieh ;
Smith, Hilary A. ;
Love, R. Rebecca ;
Lawniczak, Mara K. ;
Slotman, Michel A. ;
Emrich, Scott J. ;
Hahn, Matthew W. ;
Besansky, Nora J. .
SCIENCE, 2015, 347 (6217) :1258524
[9]   Multiple molecular evidences for a living mammalian fossil [J].
Huchon, Dorothee ;
Chevret, Pascale ;
Jordan, Ursula ;
Kilpatrick, C. William ;
Ranwez, Vincent ;
Jenkins, Paulina D. ;
Brosius, Juergen ;
Schmitz, Juergen .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (18) :7495-7499
[10]   Genomic Features of the Damselfly Calopteryx splendens Representing a Sister Clade to Most Insect Orders [J].
Ioannidis, Panagiotis ;
Simao, Felipe A. ;
Waterhouse, Robert M. ;
Manni, Mose ;
Seppey, Mathieu ;
Robertson, Hugh M. ;
Misof, Bernhard ;
Niehuis, Oliver ;
Zdobnov, Evgeny M. .
GENOME BIOLOGY AND EVOLUTION, 2017, 9 (02) :415-430