The Sequence Read Archive

被引:1848
作者
Leinonen, Rasko [1 ]
Sugawara, Hideaki [2 ,3 ]
Shumway, Martin [4 ]
机构
[1] European Bioinformat Inst, Cambridge CB10 1SD, England
[2] Res Org Informat & Syst, Ctr Informat Biol, Mishima, Shizuoka 4118540, Japan
[3] Res Org Informat & Syst, DNA Data Bank Japan, Natl Inst Genet, Mishima, Shizuoka 4118540, Japan
[4] Natl Lib Med, Natl Ctr Biotechnol Informat, NIH, Bethesda, MD 20894 USA
基金
英国惠康基金;
关键词
FORMAT;
D O I
10.1093/nar/gkq1019
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The combination of significantly lower cost and increased speed of sequencing has resulted in an explosive growth of data submitted into the primary next-generation sequence data archive, the Sequence Read Archive (SRA). The preservation of experimental data is an important part of the scientific record, and increasing numbers of journals and funding agencies require that next-generation sequence data are deposited into the SRA. The SRA was established as a public repository for the next-generation sequence data and is operated by the International Nucleotide Sequence Database Collaboration (INSDC). INSDC partners include the National Center for Biotechnology Information (NCBI), the European Bioinformatics Institute (EBI) and the DNA Data Bank of Japan (DDBJ). The SRA is accessible at http://www.ncbi.nlm.nih.gov/Traces/sra from NCBI, at http://www.ebi.ac.uk/ena from EBI and at http://trace.ddbj.nig.ac.jp from DDBJ. In this article, we present the content and structure of the SRA, detail our support for sequencing platforms and provide recommended data submission levels and formats. We also briefly outline our response to the challenge of data growth.
引用
收藏
页码:D19 / D21
页数:3
相关论文
共 9 条
[1]  
Benson DA, 2013, NUCLEIC ACIDS RES, V41, pD36, DOI [10.1093/nar/gkn723, 10.1093/nar/gkp1024, 10.1093/nar/gkw1070, 10.1093/nar/gkr1202, 10.1093/nar/gkx1094, 10.1093/nar/gkl986, 10.1093/nar/gkq1079, 10.1093/nar/gks1195, 10.1093/nar/gkg057]
[2]   ZTR: a new format for DNA sequence trace data [J].
Bonfield, JK ;
Staden, R .
BIOINFORMATICS, 2002, 18 (01) :3-10
[3]   Human genomes as email attachments [J].
Christley, Scott ;
Lu, Yiming ;
Li, Chen ;
Xie, Xiaohui .
BIOINFORMATICS, 2009, 25 (02) :274-275
[4]   The International Nucleotide Sequence Database Collaboration [J].
Cochrane, Guy ;
Karsch-Mizrachi, Ilene ;
Nakamura, Yasukazu .
NUCLEIC ACIDS RESEARCH, 2011, 39 :D15-D18
[5]   The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants [J].
Cock, Peter J. A. ;
Fields, Christopher J. ;
Goto, Naohisa ;
Heuer, Michael L. ;
Rice, Peter M. .
NUCLEIC ACIDS RESEARCH, 2010, 38 (06) :1767-1771
[6]   DDBJ launches a new archive database with analytical tools for next-generation sequence data [J].
Kaminuma, Eli ;
Mashima, Jun ;
Kodama, Yuichi ;
Gojobori, Takashi ;
Ogasawara, Osamu ;
Okubo, Kousaku ;
Takagi, Toshihisa ;
Nakamura, Yasukazu .
NUCLEIC ACIDS RESEARCH, 2010, 38 :D33-D38
[7]   Improvements to services at the European Nucleotide Archive [J].
Leinonen, Rasko ;
Akhtar, Ruth ;
Birney, Ewan ;
Bonfield, James ;
Bower, Lawrence ;
Corbett, Matt ;
Cheng, Ying ;
Demiralp, Fehmi ;
Faruque, Nadeem ;
Goodgame, Neil ;
Gibson, Richard ;
Hoad, Gemma ;
Hunter, Christopher ;
Jang, Mikyung ;
Leonard, Steven ;
Lin, Quan ;
Lopez, Rodrigo ;
Maguire, Michael ;
McWilliam, Hamish ;
Plaister, Sheila ;
Radhakrishnan, Rajesh ;
Sobhany, Siamak ;
Slater, Guy ;
Ten Hoopen, Petra ;
Valentin, Franck ;
Vaughan, Robert ;
Zalunin, Vadim ;
Zerbino, Daniel ;
Cochrane, Guy .
NUCLEIC ACIDS RESEARCH, 2010, 38 :D39-D45
[8]   Fast and accurate short read alignment with Burrows-Wheeler transform [J].
Li, Heng ;
Durbin, Richard .
BIOINFORMATICS, 2009, 25 (14) :1754-1760
[9]   Archiving next generation sequencing data [J].
Shumway, Martin ;
Cochrane, Guy ;
Sugawara, Hideaki .
NUCLEIC ACIDS RESEARCH, 2010, 38 :D870-D871