BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs

被引:8520
作者
Simao, Felipe A.
Waterhouse, Robert M.
Ioannidis, Panagiotis
Kriventseva, Evgenia V.
Zdobnov, Evgeny M. [1 ]
机构
[1] Univ Geneva, Sch Med, Dept Genet Med & Dev, CH-1211 Geneva, Switzerland
基金
瑞士国家科学基金会;
关键词
QUALITY; TOOL;
D O I
10.1093/bioinformatics/btv351
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Genomics has revolutionized biological research, but quality assessment of the resulting assembled sequences is complicated and remains mostly limited to technical measures like N50. Results: We propose a measure for quantitative assessment of genome assembly and annotation completeness based on evolutionarily informed expectations of gene content. We implemented the assessment procedure in open-source software, with sets of Benchmarking Universal Single-Copy Orthologs, named BUSCO.
引用
收藏
页码:3210 / 3212
页数:3
相关论文
共 11 条
[1]   ALE: a generic assembly likelihood evaluation framework for assessing the accuracy of genome and metagenome assemblies [J].
Clark, Scott C. ;
Egan, Rob ;
Frazier, Peter I. ;
Wang, Zhong .
BIOINFORMATICS, 2013, 29 (04) :435-443
[2]   Accelerated Profile HMM Searches [J].
Eddy, Sean R. .
PLOS COMPUTATIONAL BIOLOGY, 2011, 7 (10)
[3]   QUAST: quality assessment tool for genome assemblies [J].
Gurevich, Alexey ;
Saveliev, Vladislav ;
Vyahhi, Nikolay ;
Tesler, Glenn .
BIOINFORMATICS, 2013, 29 (08) :1072-1075
[4]   REAPR: a universal tool for genome assembly evaluation [J].
Hunt, Martin ;
Kikuchi, Taisei ;
Sanders, Mandy ;
Newbold, Chris ;
Berriman, Matthew ;
Otto, Thomas D. .
GENOME BIOLOGY, 2013, 14 (05)
[5]   A novel hybrid gene prediction method employing protein multiple sequence alignments [J].
Keller, Oliver ;
Kollmar, Martin ;
Stanke, Mario ;
Waack, Stephan .
BIOINFORMATICS, 2011, 27 (06) :757-763
[6]  
Mende DR, 2013, NAT METHODS, V10, P881, DOI [10.1038/NMETH.2575, 10.1038/nmeth.2575]
[7]   CEGMA: a pipeline to accurately annotate core genes in eukaryotic genornes [J].
Parra, Genis ;
Bradnam, Keith ;
Korf, Ian .
BIOINFORMATICS, 2007, 23 (09) :1061-1067
[8]   Assessing the gene space in draft genomes [J].
Parra, Genis ;
Bradnam, Keith ;
Ning, Zemin ;
Keane, Thomas ;
Korf, Ian .
NUCLEIC ACIDS RESEARCH, 2009, 37 (01) :289-297
[9]   Exploring genome characteristics and sequence quality without a reference [J].
Simpson, Jared T. .
BIOINFORMATICS, 2014, 30 (09) :1228-1235
[10]   OrthoDB: a hierarchical catalog of animal, fungal and bacterial orthologs [J].
Waterhouse, Robert M. ;
Tegenfeldt, Fredrik ;
Li, Jia ;
Zdobnov, Evgeny M. ;
Kriventseva, Evgenia V. .
NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) :D358-D365