OPTIMAL FALSE DISCOVERY RATE CONTROL FOR LARGE SCALE MULTIPLE TESTING WITH AUXILIARY INFORMATION

被引:0
作者
Cao, Hongyuan [1 ]
Chen, Jun [2 ]
Zhang, Xianyang [3 ]
机构
[1] Florida State Univ, Dept Stat, Tallahassee, FL 32306 USA
[2] Mayo Clin, Dept Quantitat Hlth Sci, Rochester, MN USA
[3] Texas A&M Univ, Dept Stat, College Stn, TX 77843 USA
基金
美国国家卫生研究院;
关键词
EM algorithm; false discovery rate; isotonic regression; local false discovery rate; multiple testing; Pool-Adjacent-Violators algorithm; INCREASES DETECTION POWER; EMPIRICAL BAYES; HYPOTHESES; LIKELIHOOD;
D O I
10.1214/21-AOS2128
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Large-scale multiple testing is a fundamental problem in high dimensional statistical inference. It is increasingly common that various types of auxiliary information, reflecting the structural relationship among the hypotheses, are available. Exploiting such auxiliary information can boost statistical power. To this end, we propose a framework based on a two-group mixture model with varying probabilities of being null for different hypotheses a priori, where a shape-constrained relationship is imposed between the auxiliary information and the prior probabilities of being null. An optimal rejection rule is designed to maximize the expected number of true positives when average false discovery rate is controlled. Focusing on the ordered structure, we develop a robust EM algorithm to estimate the prior probabilities of being null and the distribution of p-values under the alternative hypothesis simultaneously. We show that the proposed method has better power than state-of-the-art competitors while controlling the false discovery rate, both empirically and theoretically. Extensive simulations demonstrate the advantage of the proposed method. Datasets from genome-wide association studies are used to illustrate the new methodology.
引用
收藏
页码:807 / 857
页数:51
相关论文
共 46 条
[21]  
IGNATIADIS N., 2017, COVARIATE POWERED WE
[22]  
Ignatiadis N, 2016, NAT METHODS, V13, P577, DOI [10.1038/NMETH.3885, 10.1038/nmeth.3885]
[23]   Estimating the proportion of true null hypotheses, with application to DNA microarray data [J].
Langaas, M ;
Lindqvist, BH ;
Ferkingstad, E .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2005, 67 :555-572
[24]  
LEI L, 2016, POWER ORDERED HYPOTH
[25]   A general interactive framework for false discovery rate control under structural constraints [J].
Lei, Lihua ;
Ramdas, Aaditya ;
Fithian, William .
BIOMETRIKA, 2021, 108 (02) :253-267
[26]   AdaPT: an interactive procedure for multiple testing with side information [J].
Lei, Lihua ;
Fithian, William .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2018, 80 (04) :649-679
[27]   Multiple testing with the structure-adaptive Benjamini-Hochberg algorithm [J].
Li, Ang ;
Barber, Rina Foygel .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2019, 81 (01) :45-74
[28]   Accumulation Tests for FDR Control in Ordered Hypothesis Testing [J].
Li, Ang ;
Barber, Rina Foygel .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (518) :837-849
[29]   Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 [J].
Love, Michael I. ;
Huber, Wolfgang ;
Anders, Simon .
GENOME BIOLOGY, 2014, 15 (12)
[30]   The control of the false discovery rate in fixed sequence multiple testing [J].
Lynch, Gavin ;
Guo, Wenge ;
Sarkar, Sanat K. ;
Finner, Helmut .
ELECTRONIC JOURNAL OF STATISTICS, 2017, 11 (02) :4649-4673