Manipulation of FASTQ data with Galaxy

被引:491
作者
Blankenberg, Daniel [3 ]
Gordon, Assaf [4 ]
Von Kuster, Gregory [3 ]
Coraor, Nathan [3 ]
Taylor, James [1 ,2 ]
Nekrutenko, Anton [3 ]
机构
[1] Emory Univ, Dept Biol, Atlanta, GA 30322 USA
[2] Emory Univ, Dept Math & Comp Sci, Atlanta, GA 30322 USA
[3] Penn State Univ, Huck Inst Life Sci, University Pk, PA 16803 USA
[4] Cold Spring Harbor Lab, Howard Hughes Med Inst, Watson Sch Biol Sci, Cold Spring Harbor, NY 11724 USA
基金
美国国家科学基金会;
关键词
D O I
10.1093/bioinformatics/btq281
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Here, we describe a tool suite that functions on all of the commonly known FASTQ format variants and provides a pipeline for manipulating next generation sequencing data taken from a sequencing machine all the way through the quality filtering steps.
引用
收藏
页码:1783 / 1785
页数:3
相关论文
共 4 条
  • [1] A framework for collaborative analysis of ENCODE data: Making large-scale analyses biologist-friendly
    Blankenberg, Daniel
    Taylor, James
    Schenck, Ian
    He, Jianbin
    Zhang, Yi
    Ghent, Matthew
    Veeraraghavan, Narayanan
    Albert, Istvan
    Miller, Webb
    Makova, Kateryna D.
    Hardison, Ross C.
    Nekrutenko, Anton
    [J]. GENOME RESEARCH, 2007, 17 (06) : 960 - 964
  • [2] Blankenberg Daniel, 2010, Curr Protoc Mol Biol, VChapter 19, DOI 10.1002/0471142727.mb1910s89
  • [3] The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants
    Cock, Peter J. A.
    Fields, Christopher J.
    Goto, Naohisa
    Heuer, Michael L.
    Rice, Peter M.
    [J]. NUCLEIC ACIDS RESEARCH, 2010, 38 (06) : 1767 - 1771
  • [4] Taylor James, 2007, Curr Protoc Bioinformatics, VChapter 10, DOI 10.1002/0471250953.bi1005s19