ROAST: rotation gene set tests for complex microarray experiments

被引:392
作者
Wu, Di [1 ,2 ]
Lim, Elgene [1 ]
Vaillant, Francois [1 ]
Asselin-Labat, Marie-Liesse [1 ]
Visvader, Jane E. [1 ,2 ]
Smyth, Gordon K. [1 ,2 ]
机构
[1] Walter & Eliza Hall Inst Med Res, Parkville, Vic 3052, Australia
[2] Univ Melbourne, Melbourne, Vic 3010, Australia
基金
英国医学研究理事会;
关键词
ENRICHMENT ANALYSIS; DIFFERENTIAL EXPRESSION; PATHWAY ANALYSIS;
D O I
10.1093/bioinformatics/btq401
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: A gene set test is a differential expression analysis in which a P-value is assigned to a set of genes as a unit. Gene set tests are valuable for increasing statistical power, organizing and interpreting results and for relating expression patterns across different experiments. Existing methods are based on permutation. Methods that rely on permutation of probes unrealistically assume independence of genes, while those that rely on permutation of sample are suitable only for two-group comparisons with a good number of replicates in each group. Results: We present ROAST, a statistically rigorous gene set test that allows for gene-wise correlation while being applicable to almost any experimental design. Instead of permutation, ROAST uses rotation, a Monte Carlo technology for multivariate regression. Since the number of rotations does not depend on sample size, ROAST gives useful results even for experiments with minimal replication. ROAST allows for any experimental design that can be expressed as a linear model, and can also incorporate array weights and correlated samples. ROAST can be tuned for situations in which only a subset of the genes in the set are actively involved in the molecular pathway. ROAST can test for uni- or bi-direction regulation. Probes can also be weighted to allow for prior importance. The power and size of the ROAST procedure is demonstrated in a simulation study, and compared to that of a representative permutation method. Finally, ROAST is used to test the degree of transcriptional conservation between human and mouse mammary stems.
引用
收藏
页码:2176 / 2182
页数:7
相关论文
共 30 条
[1]   A general modular framework for gene set enrichment analysis [J].
Ackermann, Marit ;
Strimmer, Korbinian .
BMC BIOINFORMATICS, 2009, 10
[2]   Pathway analysis of Microarray data via regression [J].
Adewale, A. J. ;
Dinu, I. ;
Potter, J. D. ;
Liu, Q. ;
Yasui, Y. .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2008, 15 (03) :269-277
[3]  
[Anonymous], 1963, J R STAT SOC SER B M
[4]   Microarray analysis after RNA amplification can detect pronounced differences in gene expression using limma [J].
Diboun, Ilhem ;
Wernisch, Lorenz ;
Orengo, Christine Anne ;
Koltzenburg, Martin .
BMC GENOMICS, 2006, 7 (1)
[5]   Improving gene set analysis of microarray data by SAM-GS [J].
Dinu, Irina ;
Potter, John D. ;
Mueller, Thomas ;
Liu, Qi ;
Adewale, Adeniyi J. ;
Jhangri, Gian S. ;
Einecke, Gunilla ;
Famulski, Konrad S. ;
Halloran, Philip ;
Yasui, Yutaka .
BMC BIOINFORMATICS, 2007, 8 (1)
[6]   Rotation Testing in Gene Set Enrichment Analysis for Small Direct Comparison Experiments [J].
Dorum, Guro ;
Snipen, Lars ;
Solheim, Margrete ;
Saebo, Solve .
STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2009, 8 (01)
[7]   ON TESTING THE SIGNIFICANCE OF SETS OF GENES [J].
Efron, Bradley ;
Tibshirani, Robert .
ANNALS OF APPLIED STATISTICS, 2007, 1 (01) :107-129
[8]   Permutation methods: A basis for exact inference [J].
Ernst, MD .
STATISTICAL SCIENCE, 2004, 19 (04) :676-685
[9]   Analyzing gene expression data in terms of gene sets:: methodological issues [J].
Goeman, Jelle J. ;
Buehlmann, Peter .
BIOINFORMATICS, 2007, 23 (08) :980-987
[10]   A global test for groups of genes: testing association with a clinical outcome [J].
Goeman, JJ ;
van de Geer, SA ;
de Kort, F ;
van Houwelingen, HC .
BIOINFORMATICS, 2004, 20 (01) :93-99