Toward richer metadata for microbial sequences: replacing strain-level NCBI taxonomy taxids with BioProject, BioSample and Assembly records

被引:31
作者
Federhen, Scott [1 ]
Clark, Karen [1 ]
Barrett, Tanya [1 ]
Parkinson, Helen [2 ]
Ostell, James [1 ]
Kodama, Yuichi [3 ]
Mashima, Jun [3 ]
Nakamura, Yasukazu [3 ]
Cochrane, Guy [2 ]
Karsch-Mizrachi, Ilene [1 ]
机构
[1] NIH, Natl Ctr Biotechnol Informat, Natl Lib Med, Bethesda, MD 20892 USA
[2] European Bioinformat Inst, European Mol Biol Lab, Hinxton, England
[3] Res Org Informat & Syst, Natl Inst Genet, DDBJ Ctr, Mishima, Shizuoka 4118540, Japan
关键词
DATABASE; GENOMICS; ARCHIVE; UPDATE;
D O I
10.4056/sigs.4851102
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Microbial genome sequence submissions to the International Nucleotide Sequence Database Collaboration (INSDC) have been annotated with organism names that include the strain identifier. Each of these strain-level names has been assigned a unique 'taxid' in the NCBI Taxonomy Database. With the significant growth in genome sequencing, it is not possible to continue with the curation of strain-level taxids. In January 2014, NCBI will cease assigning strain-level taxids. Instead, submitters are encouraged provide strain information and rich metadata with their submission to the sequence database, BioProject and BioSample. Copyright (C) retained by original authors
引用
收藏
页码:1275 / 1277
页数:5
相关论文
共 14 条
[1]   NCBI GEO: archive for functional genomics data sets-update [J].
Barrett, Tanya ;
Wilhite, Stephen E. ;
Ledoux, Pierre ;
Evangelista, Carlos ;
Kim, Irene F. ;
Tomashevsky, Maxim ;
Marshall, Kimberly A. ;
Phillippy, Katherine H. ;
Sherman, Patti M. ;
Holko, Michelle ;
Yefanov, Andrey ;
Lee, Hyeseung ;
Zhang, Naigong ;
Robertson, Cynthia L. ;
Serova, Nadezhda ;
Davis, Sean ;
Soboleva, Alexandra .
NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) :D991-D995
[2]   BioProject and BioSample databases at NCBI: facilitating capture and organization of metadata [J].
Barrett, Tanya ;
Clark, Karen ;
Gevorgyan, Robert ;
Gorelenkov, Vyacheslav ;
Gribov, Eugene ;
Karsch-Mizrachi, Ilene ;
Kimelman, Michael ;
Pruitt, Kim D. ;
Resenchuk, Sergei ;
Tatusova, Tatiana ;
Yaschenko, Eugene ;
Ostell, James .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D57-D63
[3]   The complete genome sequence of Escherichia coli K-12 [J].
Blattner, FR ;
Plunkett, G ;
Bloch, CA ;
Perna, NT ;
Burland, V ;
Riley, M ;
ColladoVides, J ;
Glasner, JD ;
Rode, CK ;
Mayhew, GF ;
Gregor, J ;
Davis, NW ;
Kirkpatrick, HA ;
Goeden, MA ;
Rose, DJ ;
Mau, B ;
Shao, Y .
SCIENCE, 1997, 277 (5331) :1453-+
[4]   The NCBI Taxonomy database [J].
Federhen, Scott .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D136-D143
[5]   WHOLE-GENOME RANDOM SEQUENCING AND ASSEMBLY OF HAEMOPHILUS-INFLUENZAE RD [J].
FLEISCHMANN, RD ;
ADAMS, MD ;
WHITE, O ;
CLAYTON, RA ;
KIRKNESS, EF ;
KERLAVAGE, AR ;
BULT, CJ ;
TOMB, JF ;
DOUGHERTY, BA ;
MERRICK, JM ;
MCKENNEY, K ;
SUTTON, G ;
FITZHUGH, W ;
FIELDS, C ;
GOCAYNE, JD ;
SCOTT, J ;
SHIRLEY, R ;
LIU, LI ;
GLODEK, A ;
KELLEY, JM ;
WEIDMAN, JF ;
PHILLIPS, CA ;
SPRIGGS, T ;
HEDBLOM, E ;
COTTON, MD ;
UTTERBACK, TR ;
HANNA, MC ;
NGUYEN, DT ;
SAUDEK, DM ;
BRANDON, RC ;
FINE, LD ;
FRITCHMAN, JL ;
FUHRMANN, JL ;
GEOGHAGEN, NSM ;
GNEHM, CL ;
MCDONALD, LA ;
SMALL, KV ;
FRASER, CM ;
SMITH, HO ;
VENTER, JC .
SCIENCE, 1995, 269 (5223) :496-512
[6]   The BioSample Database (BioSD) at the European Bioinformatics Institute [J].
Gostev, Mikhail ;
Faulconbridge, Adam ;
Brandizi, Marco ;
Fernandez-Banet, Julio ;
Sarkans, Ugis ;
Brazma, Alvis ;
Parkinson, Helen .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D64-D70
[7]   Pathogen comparative genomics in the next-generation sequencing era: genome alignments, pangenomics and metagenomics [J].
Hu, Bin ;
Xie, Gary ;
Lo, Chien-Chi ;
Starkenburg, Shawn R. ;
Chain, Patrick S. G. .
BRIEFINGS IN FUNCTIONAL GENOMICS, 2011, 10 (06) :322-333
[8]   The DNA Data Bank of Japan launches a new resource, the DDBJ Omics Archive of functional genomics experiments [J].
Kodama, Yuichi ;
Mashima, Jun ;
Kaminuma, Eli ;
Gojobori, Takashi ;
Ogasawara, Osamu ;
Takagi, Toshihisa ;
Okubo, Kousaku ;
Nakamura, Yasukazu .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D38-D42
[9]  
Lapage S., 1992, INT CODE NOMENCLATUR
[10]  
McNeill J., 2006, V146, P1