Rotation Testing in Gene Set Enrichment Analysis for Small Direct Comparison Experiments

被引:24
作者
Dorum, Guro [1 ]
Snipen, Lars [1 ]
Solheim, Margrete [1 ]
Saebo, Solve [1 ]
机构
[1] Norwegian Univ Life Sci, As, Norway
关键词
gene set analysis; gene expression; microarray data analysis; ENTEROCOCCUS-FAECALIS V583; MICROARRAY DATA; EXPRESSION DATA; BILE;
D O I
10.2202/1544-6115.1418
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Gene Set Enrichment Analysis (GSEA) is a method for analysing gene expression data with a focus on a priori defined gene sets. The permutation test generally used in GSEA for testing the significance of gene set enrichment involves permutation of a phenotype vector and is developed for data from an indirect comparison design, i.e. unpaired data. In some studies the samples representing two phenotypes are paired, e.g. samples taken from a patient before and after treatment, or if samples representing two phenotypes are hybridised to the same two-channel array (direct comparison design). In this paper we will focus on data from direct comparison experiments, but the methods can be applied to paired data in general. For these types of data, a standard permutation test for paired data that randomly re-signs samples can be used. However, if the sample size is very small, which is often the case for a direct comparison design, a permutation test will give very imprecise estimates of the p-values. Here we propose using a rotation test rather than a permutation test for estimation of significance in GSEA of direct comparison data with a limited number of samples. Our proposed rotation test makes GSEA applicable to direct comparison data with few samples, by depending on rotations of the data instead of permutations. The rotation test is a generalisation of the permutation test, and can in addition be used on indirect comparison data and for testing significance of other types of test statistics outside the GSEA framework.
引用
收藏
页数:25
相关论文
共 27 条
[1]   Transcriptional response of Enterococcus faecalis V583 to erythromycin [J].
Aakra, Å ;
Vebo, H ;
Snipen, L ;
Hirt, H ;
Aastveit, A ;
Kapur, V ;
Dunny, G ;
Murray, B ;
Nes, IF .
ANTIMICROBIAL AGENTS AND CHEMOTHERAPY, 2005, 49 (06) :2246-2259
[2]  
[Anonymous], 1985, ENCY STAT SCI
[3]  
[Anonymous], APPL STATIST
[4]   Electrochemistry beyond Mn4+ in LixMn1-yLiyO2 [J].
Armstrong, AR ;
Bruce, PG .
ELECTROCHEMICAL AND SOLID STATE LETTERS, 2004, 7 (01) :A1-A4
[5]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[6]   The interaction between bacteria and bile [J].
Begley, M ;
Gahan, CGM ;
Hill, C .
FEMS MICROBIOLOGY REVIEWS, 2005, 29 (04) :625-651
[7]  
Box G. E., 1978, Statistics for experimenters, V664
[8]   DNA micro-array-based identification of bile-responsive genes in Lactobacillus plantarum [J].
Bron, PA ;
Molenaar, D ;
Vos, WM ;
Kleerebezem, M .
JOURNAL OF APPLIED MICROBIOLOGY, 2006, 100 (04) :728-738
[9]   Empirical Bayes analysis of a microarray experiment [J].
Efron, B ;
Tibshirani, R ;
Storey, JD ;
Tusher, V .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1151-1160
[10]   ON TESTING THE SIGNIFICANCE OF SETS OF GENES [J].
Efron, Bradley ;
Tibshirani, Robert .
ANNALS OF APPLIED STATISTICS, 2007, 1 (01) :107-129