Large-scale multiple testing under dependence

被引:152
作者
Sun, Wenguang [1 ]
Cai, T. Tony [1 ]
机构
[1] Univ Penn, Wharton Sch, Dept Stat, Philadelphia, PA 19104 USA
关键词
Compound decision problem; False discovery rate; Hidden Markov models; Local significance index; Multiple testing under dependence; FALSE DISCOVERY RATE; MAXIMUM-LIKELIHOOD-ESTIMATION; COMPOUND DECISION RULES; HIDDEN MARKOV-MODELS; GENE-EXPRESSION; PROBABILISTIC FUNCTIONS; ORDER ESTIMATION; ESTIMATOR; NUMBER; RATES;
D O I
10.1111/j.1467-9868.2008.00694.x
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The paper considers the problem of multiple testing under dependence in a compound decision theoretic framework. The observed data are assumed to be generated from an underlying two-state hidden Markov model. We propose oracle and asymptotically optimal data-driven procedures that aim to minimize the false non-discovery rate FNR subject to a constraint on the false discovery rate FDR. It is shown that the performance of a multiple-testing procedure can be substantially improved by adaptively exploiting the dependence structure among hypotheses, and hence conventional FDR procedures that ignore this structural information are inefficient. Both theoretical properties and numerical performances of the procedures proposed are investigated. It is shown that the procedures proposed control FDR at the desired level, enjoy certain optimality properties and are especially powerful in identifying clustered non-null cases. The new procedure is applied to an influenza-like illness surveillance study for detecting the timing of epidemic periods.
引用
收藏
页码:393 / 424
页数:32
相关论文
共 47 条
[1]  
[Anonymous], GEOGR ANAL
[2]  
[Anonymous], 1985, Applied Linear Regression, DOI DOI 10.1002/BIMJ.4710300746
[3]  
ATKINSON AC, 1973, J ROY STAT SOC B MET, V35, P473
[4]   STATISTICAL INFERENCE FOR PROBABILISTIC FUNCTIONS OF FINITE STATE MARKOV CHAINS [J].
BAUM, LE ;
PETRIE, T .
ANNALS OF MATHEMATICAL STATISTICS, 1966, 37 (06) :1554-&
[5]   A MAXIMIZATION TECHNIQUE OCCURRING IN STATISTICAL ANALYSIS OF PROBABILISTIC FUNCTIONS OF MARKOV CHAINS [J].
BAUM, LE ;
PETRIE, T ;
SOULES, G ;
WEISS, N .
ANNALS OF MATHEMATICAL STATISTICS, 1970, 41 (01) :164-&
[6]  
Benjamini Y, 2001, ANN STAT, V29, P1165
[7]  
Benjamini Y, 2000, J EDUC BEHAV STAT, V25, P60, DOI 10.2307/1165312
[8]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[9]   False discovery rates for spatial signals [J].
Benjamini, Ybav ;
Heller, Ruth .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2007, 102 (480) :1272-1281
[10]  
Bickel P. J., 1996, Bernoulli, V2, P199