Efficient estimation of the number of false positives in high-throughput screening

被引:1
作者
Rootzen, Holger [1 ]
Zholud, Dmitrii
机构
[1] Chalmers Univ Technol, Math Sci, SE-41296 Gothenburg, Sweden
基金
瑞典研究理事会;
关键词
Correction of p-values; Extreme value statistics; False discovery rate; High-throughput screening; Multiple testing; Positive false discovery rate; SmartTail; DISCOVERY RATE; EMPIRICAL BAYES; NULL;
D O I
10.1093/biomet/asv015
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper develops tail estimation methods to handle false positives in multiple testing problems where testing is done at extreme significance levels and with low degrees of freedom, and where the true null distribution may differ from the theoretical one. We show that the number of false positives, conditional on the total number of positives, has an approximately binomial distribution, and we find estimators of the distribution parameter. We also develop methods for estimation of the true null distribution, as well as techniques to compare it with the theoretical one. Analysis is based on a simple polynomial model for very small p-values. Asymptotics that motivate the model, properties of the estimators, and model-checking tools are provided. The methods are applied to two large genomic studies and an fMRI brain scan experiment.
引用
收藏
页码:695 / 704
页数:10
相关论文
共 18 条
[1]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[2]  
Coles S., 2001, An Introduction to Statistical Modelling of Extreme Values
[3]  
Dudoit S, 2008, SPRINGER SER STAT, P1
[4]   Large-scale simultaneous hypothesis testing: The choice of a null hypothesis [J].
Efron, B .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2004, 99 (465) :96-104
[5]   Empirical Bayes analysis of a microarray experiment [J].
Efron, B ;
Tibshirani, R ;
Storey, JD ;
Tusher, V .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1151-1160
[6]   Microarrays, empirical Bayes and the two-groups model [J].
Efron, Bradley .
STATISTICAL SCIENCE, 2008, 23 (01) :1-22
[7]   To how many simultaneous hypothesis tests can normal, student's t or bootstrap calibration be applied? [J].
Fan, Jianqing ;
Hall, Peter ;
Yao, Qiwei .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2007, 102 (480) :1282-1288
[8]   Estimating the null and the proportion of nonnull effects in large-scale multiple comparisons [J].
Jin, Jiashun ;
Cai, T. Tony .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2007, 102 (478) :495-506
[9]   Comments on the analysis of unbalanced microarray data [J].
Kerr, Kathleen F. .
BIOINFORMATICS, 2009, 25 (16) :2035-2041
[10]   Fewer permutations, more accurate P-values [J].
Knijnenburg, Theo A. ;
Wessels, Lodewyk F. A. ;
Reinders, Marcel J. T. ;
Shmulevich, Ilya .
BIOINFORMATICS, 2009, 25 (12) :I161-I168