Optimal control of false discovery criteria in the two-group model

被引:16
作者
Heller, Ruth [1 ]
Rosset, Saharon [1 ]
机构
[1] Tel Aviv Univ, Dept Stat & Operat Res, IL-6997801 Tel Aviv, Israel
关键词
false discovery rate; infinite linear programming; large‐ scale inference; multiple testing; positive FDR; EMPIRICAL BAYES; MICROARRAYS; PROPORTION; NULL;
D O I
10.1111/rssb.12403
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The highly influential two-group model in testing a large number of statistical hypotheses assumes that the test statistics are drawn independently from a mixture of a high probability null distribution and a low probability alternative. Optimal control of the marginal false discovery rate (mFDR), in the sense that it provides maximal power (expected true discoveries) subject to mFDR control, is known to be achieved by thresholding the local false discovery rate (locFDR), the probability of the hypothesis being null given the set of test statistics, with a fixed threshold. We address the challenge of controlling optimally the popular false discovery rate (FDR) or positive FDR (pFDR) in the general two-group model, which also allows for dependence between the test statistics. These criteria are less conservative than the mFDR criterion, so they make more rejections in expectation. We derive their optimal multiple testing (OMT) policies, which turn out to be thresholding the locFDR with a threshold that is a function of the entire set of statistics. We develop an efficient algorithm for finding these policies, and use it for problems with thousands of hypotheses. We illustrate these procedures on gene expression studies.
引用
收藏
页码:133 / 155
页数:23
相关论文
共 25 条
[1]   ADEPTUS: a discovery tool for disease prediction, enrichment and network analysis based on profiles from many diseases [J].
Amar, David ;
Vizel, Amir ;
Levy, Carmit ;
Shamir, Ron .
BIOINFORMATICS, 2018, 34 (11) :1959-1961
[2]  
[Anonymous], 2022, Testing Statistical Hypotheses, DOI DOI 10.1007/978-3-030-70578-7
[3]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[4]   Comment: Microarrays, empirical Bayes and the two-groups model [J].
Benjamini, Yoav ;
Cai, T. Tony ;
Morris, Carl N. ;
Rice, Kenneth ;
Spiegelhalter, David ;
Efron, Bradley .
STATISTICAL SCIENCE, 2008, 23 (01) :23-47
[5]   Adaptive linear step-up procedures that control the false discovery rate [J].
Benjamini, Yoav ;
Krieger, Abba M. ;
Yekutieli, Daniel .
BIOMETRIKA, 2006, 93 (03) :491-507
[6]  
Blanchard G, 2009, J MACH LEARN RES, V10, P2837
[7]   Covariate-assisted ranking and screening for large-scale two-sample inference [J].
Cai, T. Tony ;
Sun, Wenguang ;
Wang, Weinan .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2019, 81 (02) :187-234
[8]   Optimal screening and discovery of sparse signals with applications to multistage high throughput studies [J].
Cai, T. Tony ;
Sun, Wenguang .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2017, 79 (01) :197-223
[9]   Empirical Bayes analysis of a microarray experiment [J].
Efron, B ;
Tibshirani, R ;
Storey, JD ;
Tusher, V .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1151-1160
[10]   Microarrays, empirical Bayes and the two-groups model [J].
Efron, Bradley .
STATISTICAL SCIENCE, 2008, 23 (01) :1-22