BUSCO Applications from Quality Assessments to Gene Prediction and Phylogenomics

被引:1461
作者
Waterhouse, Robert M. [1 ,2 ,3 ]
Seppey, Mathieu [1 ,2 ]
Simao, Felipe A. [1 ,2 ]
Manni, Mose [1 ,2 ]
Ioannidis, Panagiotis [1 ,2 ]
Klioutchnikov, Guennadi [1 ,2 ]
Kriventseva, Evgenia V. [1 ,2 ]
Zdobnov, Evgeny M. [1 ,2 ]
机构
[1] Univ Geneva, Dept Genet Med & Dev, Med Sch, Geneva, Switzerland
[2] Swiss Inst Bioinformat, Geneva, Switzerland
[3] Univ Lausanne, Dept Ecol & Evolut, Lausanne, Switzerland
基金
瑞士国家科学基金会;
关键词
transcriptomics; metagenomics; bioinformatics; evolution; GENOME; ORTHOLOGS; COMPLETENESS; IMPROVEMENTS; EVOLUTIONARY; BACTERIAL; ORTHODB; FUNGAL; PLANT; TOOL;
D O I
10.1093/molbev/msx319
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Genomics promises comprehensive surveying of genomes and metagenomes, but rapidly changing technologies and expanding data volumes make evaluation of completeness a challenging task. Technical sequencing quality metrics can be complemented by quantifying completeness of genomic data sets in terms of the expected gene content of Benchmarking Universal Single-Copy Orthologs (BUSCO, http://busco.ezlab.org). The latest software release implements a complete refactoring of the code to make it more flexible and extendable to facilitate high-throughput assessments. The original six lineage assessment data sets have been updated with improved species sampling, 34 new subsets have been built for vertebrates, arthropods, fungi, and prokaryotes that greatly enhance resolution, and data sets are now also available for nematodes, protists, and plants. Here, we present BUSCO v3 with example analyses that highlight the wide-ranging utility of BUSCO assessments, which extend beyond quality control of genomics data sets to applications in comparative genomics analyses, gene predictor training, metagenomics, and phylogenomics.
引用
收藏
页码:543 / 548
页数:6
相关论文
共 30 条
  • [1] Rodent phylogeny revised: analysis of six nuclear genes from all major rodent clades
    Blanga-Kanfi, Shani
    Miranda, Hector
    Penn, Osnat
    Pupko, Tal
    DeBry, Ronald W.
    Huchon, Dorothee
    [J]. BMC EVOLUTIONARY BIOLOGY, 2009, 9
  • [2] BLAST plus : architecture and applications
    Camacho, Christiam
    Coulouris, George
    Avagyan, Vahram
    Ma, Ning
    Papadopoulos, Jason
    Bealer, Kevin
    Madden, Thomas L.
    [J]. BMC BIOINFORMATICS, 2009, 10
  • [3] trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses
    Capella-Gutierrez, Salvador
    Silla-Martinez, Jose M.
    Gabaldon, Toni
    [J]. BIOINFORMATICS, 2009, 25 (15) : 1972 - 1973
  • [4] Major Improvements to the Heliconius melpomene Genome Assembly Used to Confirm 10 Chromosome Fusion Events in 6 Million Years of Butterfly Evolution
    Davey, John W.
    Chouteau, Mathieu
    Barker, Sarah L.
    Maroja, Luana
    Baxter, Simon W.
    Simpson, Fraser
    Joron, Mathieu
    Mallet, James
    Dasmahapatra, Kanchon K.
    Jiggins, Chris D.
    [J]. G3-GENES GENOMES GENETICS, 2016, 6 (03): : 695 - 708
  • [5] Accelerated Profile HMM Searches
    Eddy, Sean R.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2011, 7 (10)
  • [6] Finding the missing honey bee genes: lessons learned from a genome upgrade
    Elsik, Christine G.
    Worley, Kim C.
    Bennett, Anna K.
    Beye, Martin
    Camara, Francisco
    Childers, Christopher P.
    de Graaf, Dirk C.
    Debyser, Griet
    Deng, Jixin
    Devreese, Bart
    Elhaik, Eran
    Evans, Jay D.
    Foster, Leonard J.
    Graur, Dan
    Guigo, Roderic
    Hoff, Katharina Jasmin
    Holder, Michael E.
    Hudson, Matthew E.
    Hunt, Greg J.
    Jiang, Huaiyang
    Joshi, Vandita
    Khetani, Radhika S.
    Kosarev, Peter
    Kovar, Christie L.
    Ma, Jian
    Maleszka, Ryszard
    Moritz, Robin F. A.
    Munoz-Torres, Monica C.
    Murphy, Terence D.
    Muzny, Donna M.
    Newsham, Irene F.
    Reese, Justin T.
    Robertson, Hugh M.
    Robinson, Gene E.
    Rueppell, Olav
    Solovyev, Victor
    Stanke, Mario
    Stolle, Eckart
    Tsuruda, Jennifer M.
    Van Vaerenbergh, Matthias
    Waterhouse, Robert M.
    Weaver, Daniel B.
    Whitfield, Charles W.
    Wu, Yuanqing
    Zdobnov, Evgeny M.
    Zhang, Lan
    Zhu, Dianhui
    Gibbs, Richard A.
    [J]. BMC GENOMICS, 2014, 15
  • [7] Phylogenomic Analysis of Spiders Reveals Nonmonophyly of Orb Weavers
    Fernandez, Rosa
    Hormiga, Gustavo
    Giribet, Gonzalo
    [J]. CURRENT BIOLOGY, 2014, 24 (15) : 1772 - 1777
  • [8] Extensive introgression in a malaria vector species complex revealed by phylogenomics
    Fontaine, Michael C.
    Pease, James B.
    Steele, Aaron
    Waterhouse, Robert M.
    Neafsey, Daniel E.
    Sharakhov, Igor V.
    Jiang, Xiaofang
    Hall, Andrew B.
    Catteruccia, Flaminia
    Kakani, Evdoxia
    Mitchell, Sara N.
    Wu, Yi-Chieh
    Smith, Hilary A.
    Love, R. Rebecca
    Lawniczak, Mara K.
    Slotman, Michel A.
    Emrich, Scott J.
    Hahn, Matthew W.
    Besansky, Nora J.
    [J]. SCIENCE, 2015, 347 (6217) : 1258524
  • [9] Multiple molecular evidences for a living mammalian fossil
    Huchon, Dorothee
    Chevret, Pascale
    Jordan, Ursula
    Kilpatrick, C. William
    Ranwez, Vincent
    Jenkins, Paulina D.
    Brosius, Juergen
    Schmitz, Juergen
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (18) : 7495 - 7499
  • [10] Genomic Features of the Damselfly Calopteryx splendens Representing a Sister Clade to Most Insect Orders
    Ioannidis, Panagiotis
    Simao, Felipe A.
    Waterhouse, Robert M.
    Manni, Mose
    Seppey, Mathieu
    Robertson, Hugh M.
    Misof, Bernhard
    Niehuis, Oliver
    Zdobnov, Evgeny M.
    [J]. GENOME BIOLOGY AND EVOLUTION, 2017, 9 (02): : 415 - 430