Estimating the null and the proportion of nonnull effects in large-scale multiple comparisons

被引:122
作者
Jin, Jiashun [1 ]
Cai, T. Tony
机构
[1] Purdue Univ, Dept Stat, W Lafayette, IN 47907 USA
[2] Univ Penn, Dept Stat, Wharton Sch, Philadelphia, PA 19104 USA
基金
美国国家科学基金会;
关键词
characteristic functions; empirical characteristic function; Fourier coefficients; multiple testing; null distribution; proportion of nonnull effects;
D O I
10.1198/016214507000000167
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
An important issue raised by Efron in the context of large-scale multiple comparisons is that in many applications, the usual assumption that the null distribution is known is incorrect, and seemingly negligible differences in the null may result in large differences in subsequent studies. This suggests that a careful study of estimation of the null is indispensable. In this article we consider the problem of estimating a null normal distribution, and a closely related problem, estimation of the proportion of nonnull effects. We develop an approach based on the empirical characteristic function and Fourier analysis. The estimators are shown to be uniformly consistent over a wide class of parameters. We investigate the numerical performance of the estimators using both simulated and real data. In particular, we apply our procedure to the analysis of breast cancer and human immunodeficiency virus microarray datasets. The estimators perform favorably compared with existing methods.
引用
收藏
页码:495 / 506
页数:12
相关论文
共 21 条
[1]   Adapting to unknown sparsity by controlling the false discovery rate [J].
Abramovich, Felix ;
Benjamini, Yoav ;
Donoho, David L. ;
Johnstone, Iain M. .
ANNALS OF STATISTICS, 2006, 34 (02) :584-653
[2]  
[Anonymous], 1998, PHYS A
[3]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[4]  
Bosq D., 1998, Nonparametric Statistics for Stochastic Processes: Estimation and Prediction
[5]  
CAI T, 2007, IN PRESS ANN STAT
[6]   Statistical tests for differential expression in cDNA microarray experiments [J].
Cui, XQ ;
Churchill, GA .
GENOME BIOLOGY, 2003, 4 (04)
[7]   Higher criticism for detecting sparse heterogeneous mixtures [J].
Donoho, D ;
Jin, JS .
ANNALS OF STATISTICS, 2004, 32 (03) :962-994
[8]   Large-scale simultaneous hypothesis testing: The choice of a null hypothesis [J].
Efron, B .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2004, 99 (465) :96-104
[9]   Empirical Bayes analysis of a microarray experiment [J].
Efron, B ;
Tibshirani, R ;
Storey, JD ;
Tusher, V .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1151-1160
[10]   ON THE OPTIMAL RATES OF CONVERGENCE FOR NONPARAMETRIC DECONVOLUTION PROBLEMS [J].
FAN, JQ .
ANNALS OF STATISTICS, 1991, 19 (03) :1257-1272