The control of the false discovery rate in fixed sequence multiple testing

被引:14
作者
Lynch, Gavin [1 ]
Guo, Wenge [2 ]
Sarkar, Sanat K. [3 ]
Finner, Helmut [4 ]
机构
[1] Catchpoint Syst Inc, New York, NY 10003 USA
[2] New Jersey Inst Technol, Dept Math Sci, Newark, NJ 07102 USA
[3] Temple Univ, Dept Stat Sci, Philadelphia, PA 19122 USA
[4] Heinrich Heine Univ Dusseldorf, Inst Biometr & Epidemiol, German Diabet Ctr DDZ, Leibniz Ctr Diabet Res, Hennekamp 65, D-40225 Dusseldorf, Germany
关键词
Arbitrary dependence; false discovery rate; fixed sequence; multiple testing; negative association; PRDS property; p-values; DATA-DRIVEN ORDER; GENE-EXPRESSION; FDR CONTROL; HYPOTHESES; FALLBACK;
D O I
10.1214/17-EJS1359
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Controlling false discovery rate (FDR) is a powerful approach to multiple testing. In many applications, the tested hypotheses have an inherent hierarchical structure. In this paper, we focus on the fixed sequence structure where the testing order of the hypotheses has been strictly specified in advance. We are motivated to study such a structure, since it is the most basic of hierarchical structure, yet it is often seen in real applications such as statistical process control and streaming data analysis. We first consider a conventional fixed sequence method that stops testing once an acceptance occurs, and develop such a method controlling FDR under both arbitrary and negative dependencies. The method under arbitrary dependency is shown to be unimprovable without losing control of FDR and, unlike existing FDR methods; it cannot be improved even by restricting to the usual positive regression dependence on subset (PRDS) condition. To account for any potential mistakes in the ordering of the tests, we extend the conventional fixed sequence method to one that allows more but a given number of acceptances. Simulation studies show that the proposed procedures can be powerful alternatives to existing FDR controlling procedures. The proposed procedures are illustrated through a real data set from a microarray experiment.
引用
收藏
页码:4649 / 4673
页数:25
相关论文
共 43 条
[1]   Generalized α-investing: definitions, optimality results and application to public databases [J].
Aharoni, Ehud ;
Rosset, Saharon .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2014, 76 (04) :771-794
[2]  
[Anonymous], 2022, Testing Statistical Hypotheses, DOI [DOI 10.1007/978-3-030-70578-7, 10.1007/978-3-030-70578-7]
[3]   CONTROLLING THE FALSE DISCOVERY RATE VIA KNOCKOFFS [J].
Barber, Rina Foygel ;
Candes, Emmanuel J. .
ANNALS OF STATISTICS, 2015, 43 (05) :2055-2085
[4]   A step-down multiple hypotheses testing procedure that controls the false discovery rate under independence [J].
Benjamini, Y ;
Liu, W .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 1999, 82 (1-2) :163-170
[5]  
Benjamini Y, 2001, ANN STAT, V29, P1165
[6]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[7]   False discovery rates for spatial signals [J].
Benjamini, Ybav ;
Heller, Ruth .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2007, 102 (480) :1272-1281
[8]   Key multiplicity issues in clinical drug development [J].
Dmitrienko, Alex ;
D'Agostino, Ralph B., Sr. ;
Huque, Mohammad F. .
STATISTICS IN MEDICINE, 2013, 32 (07) :1079-1111
[9]   Microarrays, empirical Bayes and the two-groups model [J].
Efron, Bradley .
STATISTICAL SCIENCE, 2008, 23 (01) :1-22
[10]   FDR Control with Pseudo-Gatekeeping Based on a Possibly Data Driven Order of the Hypotheses [J].
Farcomeni, A. ;
Finos, L. .
BIOMETRICS, 2013, 69 (03) :606-613