INFRASTRUCTURE FOR METAGENOME DATA MANAGEMENT AND ANALYSIS

被引:0
作者
Tatusova, Tatiana [1 ]
机构
[1] NIH, Natl Ctr Biotechnol Informat, Natl Lib Med, Bethesda, MD 20892 USA
来源
BIOINFORMATICS 2011 | 2011年
关键词
Database; Sequence analysis; Metagenomics; BLAST;
D O I
暂无
中图分类号
R-058 [];
学科分类号
摘要
Metagenome sequencing projects are generating unprecedented amounts of data. Public sequence archive databases are challenged with large-scale data management issues including data storage, quick search and retrieval of the sequence data for further analysis. The sequence data is linked to the rich set of metadata attributes such as geochemical and ecological parameters for environmental projects and clinical patient information for human microbiome studies. That complex collection of heterogeneous information has to be integrated, organized and presented to the users in a meaningful and the most useful way. For the last 20 years The National Center for Biotechnology Information (NCBI) has been developing the infrastructure that allows an easy storage and distribution of various types of bimolecular data as well as data integration and easy navigation in complex information space. Here we describe NCBI resources that are used for metagenomics data management.
引用
收藏
页码:357 / 362
页数:6
相关论文
共 9 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[3]  
[Anonymous], NUCLEIC ACIDS RES S1
[4]  
[Anonymous], NUCL ACIDS RES
[5]  
Benson DA, 2013, NUCLEIC ACIDS RES, V41, pD36, DOI [10.1093/nar/gkn723, 10.1093/nar/gkp1024, 10.1093/nar/gkw1070, 10.1093/nar/gkr1202, 10.1093/nar/gkx1094, 10.1093/nar/gkl986, 10.1093/nar/gkq1079, 10.1093/nar/gks1195, 10.1093/nar/gkg057]
[6]  
Cummings L, 2002, FEMS MICROBIOL LETT, V216, P133, DOI 10.1016/S0378-1097(02)00955-2
[7]   NCBI Reference Sequences: current status, policy and new initiatives [J].
Pruitt, Kim D. ;
Tatusova, Tatiana ;
Klimke, William ;
Maglott, Donna R. .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D32-D36
[8]   BLAST: improvements for better sequence analysis [J].
Ye, Jian ;
McGinnis, Scott ;
Madden, Thomas L. .
NUCLEIC ACIDS RESEARCH, 2006, 34 :W6-W9
[9]   A greedy algorithm for aligning DNA sequences [J].
Zhang, Z ;
Schwartz, S ;
Wagner, L ;
Miller, W .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (1-2) :203-214