Software for Computing and Annotating Genomic Ranges

被引:2532
作者
Lawrence, Michael [1 ]
Huber, Wolfgang [2 ,3 ]
Pages, Herve [4 ]
Aboyoun, Patrick [4 ]
Carlson, Marc [4 ]
Gentleman, Robert [1 ]
Morgan, Martin T. [4 ]
Carey, Vincent J. [5 ]
机构
[1] Genentech Inc, Bioinformat & Computat Biol, San Francisco, CA 94080 USA
[2] European Mol Biol Lab, Genome Biol Unit, D-69012 Heidelberg, Germany
[3] European Bioinformat Inst, Cambridge, England
[4] Fred Hutchinson Canc Res Ctr, Seattle, WA 98104 USA
[5] Harvard Univ, Brigham & Womens Hosp, Sch Med, Channing Div Network Med, Boston, MA 02115 USA
基金
美国国家卫生研究院;
关键词
BIOCONDUCTOR; PACKAGE;
D O I
10.1371/journal.pcbi.1003118
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We describe Bioconductor infrastructure for representing and computing on annotated genomic ranges and integrating genomic data with the statistical computing features of R and its extensions. At the core of the infrastructure are three packages: IRanges, GenomicRanges, and GenomicFeatures. These packages provide scalable data structures for representing annotated ranges on the genome, with special support for transcript structures, read alignments and coverage vectors. Computational facilities include efficient algorithms for overlap and nearest neighbor detection, coverage calculation and other range operations. This infrastructure directly supports more than 80 other Bioconductor packages, including those for sequence analysis, differential expression analysis and visualization.
引用
收藏
页数:10
相关论文
共 15 条
  • [11] Morgan M.H. Pages., 2013, Rsamtools: Binary alignment (BAM), variant call (BCF)
  • [12] ShortRead: a bioconductor package for input, quality assessment and exploration of high-throughput sequence data
    Morgan, Martin
    Anders, Simon
    Lawrence, Michael
    Aboyoun, Patrick
    Pages, Herve
    Gentleman, Robert
    [J]. BIOINFORMATICS, 2009, 25 (19) : 2607 - 2608
  • [13] Pages H., 2013, Biostrings: string objects representing biological sequences and matching algorithms
  • [14] BEDTools: a flexible suite of utilities for comparing genomic features
    Quinlan, Aaron R.
    Hall, Ira M.
    [J]. BIOINFORMATICS, 2010, 26 (06) : 841 - 842
  • [15] Fast and SNP-tolerant detection of complex variants and splicing in short reads
    Wu, Thomas D.
    Nacu, Serban
    [J]. BIOINFORMATICS, 2010, 26 (07) : 873 - 881