Estimating the null and the proportion of nonnull effects in large-scale multiple comparisons

被引：122

作者：

Jin, Jiashun ^{[1
]}

Cai, T. Tony

机构：

[1] Purdue Univ, Dept Stat, W Lafayette, IN 47907 USA

[2] Univ Penn, Dept Stat, Wharton Sch, Philadelphia, PA 19104 USA

来源：

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION | 2007年 / 102卷 / 478期

基金：

美国国家科学基金会;

关键词：

characteristic functions; empirical characteristic function; Fourier coefficients; multiple testing; null distribution; proportion of nonnull effects;

D O I：

10.1198/016214507000000167

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

An important issue raised by Efron in the context of large-scale multiple comparisons is that in many applications, the usual assumption that the null distribution is known is incorrect, and seemingly negligible differences in the null may result in large differences in subsequent studies. This suggests that a careful study of estimation of the null is indispensable. In this article we consider the problem of estimating a null normal distribution, and a closely related problem, estimation of the proportion of nonnull effects. We develop an approach based on the empirical characteristic function and Fourier analysis. The estimators are shown to be uniformly consistent over a wide class of parameters. We investigate the numerical performance of the estimators using both simulated and real data. In particular, we apply our procedure to the analysis of breast cancer and human immunodeficiency virus microarray datasets. The estimators perform favorably compared with existing methods.

引用

页码：495 / 506

页数：12

共 21 条

[1] Adapting to unknown sparsity by controlling the false discovery rate [J].