Gene ontology analysis for RNA-seq: accounting for selection bias

被引:5505
作者
Young, Matthew D. [1 ]
Wakefield, Matthew J. [1 ]
Smyth, Gordon K. [1 ]
Oshlack, Alicia [1 ]
机构
[1] Walter & Eliza Hall Inst Med Res, Bioinformat Div, Parkville, Vic 3052, Australia
关键词
ENRICHMENT ANALYSIS; TOOL;
D O I
10.1186/gb-2010-11-2-r14
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
We present GOseq, an application for performing Gene Ontology (GO) analysis on RNA-seq data. GO analysis is widely used to reduce complexity and highlight biological processes in genome-wide expression studies, but standard methods give biased results on RNA-seq data due to over-detection of differential expression for long and highly expressed transcripts. Application of GOseq to a prostate cancer data set shows that GOseq dramatically changes the results, highlighting categories more consistent with the known biology.
引用
收藏
页数:12
相关论文
共 32 条
[1]   Improved scoring of functional groups from gene expression data by decorrelating GO graph structure [J].
Alexa, Adrian ;
Rahnenfuehrer, Joerg ;
Lengauer, Thomas .
BIOINFORMATICS, 2006, 22 (13) :1600-1607
[2]  
[Anonymous], 1963, J R STAT SOC SER B M
[3]  
[Anonymous], R: The R project for statistical computing
[4]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[5]   GOstat: find statistically overrepresented Gene Ontologies within a group of genes [J].
Beissbarth, T ;
Speed, TP .
BIOINFORMATICS, 2004, 20 (09) :1464-1465
[6]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[7]  
*BIOMART, ENS BIOMART
[8]  
DePrimo SE, 2002, GENOME BIOL, V3
[9]   Weighted random sampling with a reservoir [J].
Efraimidis, PS ;
Spirakis, PG .
INFORMATION PROCESSING LETTERS, 2006, 97 (05) :181-185
[10]   The development of androgen-independent prostate cancer [J].
Feldman, BJ ;
Feldman, D .
NATURE REVIEWS CANCER, 2001, 1 (01) :34-45