Tweedie's Formula and Selection Bias

被引:228
作者
Efron, Bradley [1 ]
机构
[1] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
Bayesian relevance; Empirical Bayes information; False discovery rates; James-Stein; Regret; Winner's curse; EMPIRICAL BAYES; WINNERS CURSE; ODDS RATIOS; ESTIMATORS; HYPOTHESIS; GENOMEWIDE; INFERENCE; VALUES;
D O I
10.1198/jasa.2011.tm11181
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We suppose that the statistician observes some large number of estimates z(i), each with its own unobserved expectation parameter mu(i). The largest few of the z(i)'s are likely to substantially overestimate their corresponding mu(i)'s, this being an example of selection bias, or regression to the mean. Tweedie's formula, first reported by Robbins in 1956, offers a simple empirical Bayes approach for correcting selection bias. This article investigates its merits and limitations. In addition to the methodology, Tweedie's formula raises more general questions concerning empirical Bayes theory, discussed here as "relevance" and "empirical Bayes information." There is a close connection between applications of the formula and James-Stein estimation.
引用
收藏
页码:1602 / 1614
页数:13
相关论文
共 24 条
[1]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[2]  
BERGER JO, 1987, J AM STAT ASSOC, V82, P112, DOI 10.2307/2289131
[3]   ADMISSIBLE ESTIMATORS, RECURRENT DIFFUSIONS, AND INSOLUBLE BOUNDARY VALUE PROBLEMS [J].
BROWN, LD .
ANNALS OF MATHEMATICAL STATISTICS, 1971, 42 (03) :855-&
[4]  
Dawid AP, 1994, INST MATH S, V24, P211, DOI 10.1214/lnms/1215463797
[5]  
Efron B., 2010, I MATH STAT MONOGRAP, V1
[6]   SIMULTANEOUS INFERENCE: WHEN SHOULD HYPOTHESIS TESTING PROBLEMS BE COMBINED? [J].
Efron, Bradley .
ANNALS OF APPLIED STATISTICS, 2008, 2 (01) :197-223
[7]   Microarrays, empirical Bayes and the two-groups model [J].
Efron, Bradley .
STATISTICAL SCIENCE, 2008, 23 (01) :1-22
[8]   False discovery rates and copy number variation [J].
Efron, Bradley ;
Zhang, Nancy R. .
BIOMETRIKA, 2011, 98 (02) :251-271
[9]  
Efron B, 2010, J AM STAT ASSOC, V105, P1042, DOI 10.1198/jasa.2010.tm09129
[10]   Empirical Bayes Estimates for Large-Scale Prediction Problems [J].
Efron, Bradley .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2009, 104 (487) :1015-1028