Gene ontology analysis for RNA-seq: accounting for selection bias

被引:0
作者
Matthew D Young
Matthew J Wakefield
Gordon K Smyth
Alicia Oshlack
机构
[1] The Walter and Eliza Hall Institute of Medical Research,Bioinformatics Division
来源
Genome Biology | / 11卷
关键词
Gene Ontology; Androgen; LNCaP Cell; Differentially Express; Differentially Express Gene;
D O I
暂无
中图分类号
学科分类号
摘要
We present GOseq, an application for performing Gene Ontology (GO) analysis on RNA-seq data. GO analysis is widely used to reduce complexity and highlight biological processes in genome-wide expression studies, but standard methods give biased results on RNA-seq data due to over-detection of differential expression for long and highly expressed transcripts. Application of GOseq to a prostate cancer data set shows that GOseq dramatically changes the results, highlighting categories more consistent with the known biology.
引用
收藏
相关论文
共 145 条
  • [1] Fu X(2009)Estimating accuracy of RNA-Seq and microarrays with proteomics. BMC Genomics 10 161-476
  • [2] Fu N(2008)Alternative isoform regulation in human tissue transcriptomes. Nature 456 470-986
  • [3] Guo S(2008)Transcriptome-wide identification of novel imprinted genes in neonatal mouse brain. PloS One 3 e3839-1465
  • [4] Yan Z(2009)Large-scale mRNA sequencing determines global regulation of RNA editing during brain development. Genome Res 19 978-1607
  • [5] Xu Y(2009)Transcript length bias in RNA-seq data confounds systems biology. Biol Direct 4 14-15550
  • [6] Hu H(2007)EasyGO: Gene Ontology-based annotation and functional enrichment analysis tool for agronomical species. BMC Genomics 8 246-57
  • [7] Menzel C(2003)GoMiner: a resource for biological interpretation of genomic and proteomic data. Genome biology 4 R28-20184
  • [8] Chen W(2004)GOstat: find statistically overrepresented Gene Ontologies within a group of genes. Bioinformatics 20 1464-29
  • [9] Li Y(2004)GOToolBox: functional analysis of gene datasets based on Gene Ontology. Genome Biol 5 R101-2887
  • [10] Zeng R(2006)Improved scoring of functional groups from gene expression data by decorrelating GO graph structure. Bioinformatics 22 1600-332