Further Steps in TANGO: improved taxonomic assignment in metagenomics

被引:17
作者
Alonso-Alemany, Daniel [1 ]
Barre, Aurelien [2 ]
Beretta, Stefano [3 ]
Bonizzoni, Paola [3 ]
Nikolski, Macha [2 ,4 ]
Valiente, Gabriel [1 ]
机构
[1] Tech Univ Catalonia, Dept Software, E-08034 Barcelona, Spain
[2] Univ Bordeaux, Bordeaux Bioinformat Ctr CBiB, F-33000 Bordeaux, France
[3] Univ Milano Bicocca, Dipartimento Informat Sistemist & Comunicaz, I-20125 Milan, Italy
[4] Univ Bordeaux, Lab Bordelais Rech Informat CNRS LaBRI, F-33405 Talence, France
关键词
GUT MICROBIOME; CLASSIFICATION; CHALLENGES; DATABASE;
D O I
10.1093/bioinformatics/btt256
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: TANGO is one of the most accurate tools for the taxonomic assignment of sequence reads. However, because of the differences in the taxonomy structures, performing a taxonomic assignment on different reference taxonomies will produce divergent results. Results: We have improved the TANGO pipeline to be able to perform the taxonomic assignment of a metagenomic sample using alternative reference taxonomies, coming from different sources. We highlight the novel pre-processing step, necessary to accomplish this task, and describe the improvements in the assignment process. We present the new TANGO pipeline in details, and, finally, we show its performance on four real metagenomic datasets and also on synthetic datasets.
引用
收藏
页码:17 / 23
页数:7
相关论文
共 19 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]  
[Anonymous], EMBNET J
[3]   Flexible taxonomic assignment of ambiguous sequencing reads [J].
Clemente, Jose C. ;
Jansson, Jesper ;
Valiente, Gabriel .
BMC BIOINFORMATICS, 2011, 12
[4]   The Ribosomal Database Project: improved alignments and new tools for rRNA analysis [J].
Cole, J. R. ;
Wang, Q. ;
Cardenas, E. ;
Fish, J. ;
Chai, B. ;
Farris, R. J. ;
Kulam-Syed-Mohideen, A. S. ;
McGarrell, D. M. ;
Marsh, T. ;
Garrity, G. M. ;
Tiedje, J. M. .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D141-D145
[5]   Taxonomic binning of metagenome samples generated by next-generation sequencing technologies [J].
Droege, Johannes ;
McHardy, Alice C. .
BRIEFINGS IN BIOINFORMATICS, 2012, 13 (06) :646-655
[6]   The NCBI Taxonomy database [J].
Federhen, Scott .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D136-D143
[7]   MEGAN analysis of metagenomic data [J].
Huson, Daniel H. ;
Auch, Alexander F. ;
Qi, Ji ;
Schuster, Stephan C. .
GENOME RESEARCH, 2007, 17 (03) :377-386
[8]  
Knuth D.E., 1997, ART COMPUTER PROGRAM, V1, P334
[9]   Ultrafast clustering algorithms for metagenomic sequence analysis [J].
Li, Weizhong ;
Fu, Limin ;
Niu, Beifang ;
Wu, Sitao ;
Wooley, John .
BRIEFINGS IN BIOINFORMATICS, 2012, 13 (06) :656-668
[10]   Classification of metagenomic sequences: methods and challenges [J].
Mande, Sharmila S. ;
Mohammed, Monzoorul Haque ;
Ghosh, Tarini Shankar .
BRIEFINGS IN BIOINFORMATICS, 2012, 13 (06) :669-681