Large-scale simultaneous hypothesis testing: The choice of a null hypothesis

被引:660
作者
Efron, B [1 ]
机构
[1] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
关键词
empirical Bayes; empirical null hypothesis; local false discovery rate; microarray analysis; unobserved covariates;
D O I
10.1198/016214504000000089
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Current scientific techniques in genomics and image processing routinely produce hypothesis testing problems with hundreds or thousands of cases to consider simultaneously. This poses new difficulties for the statistician, but also opens new opportunities. In particular, it allows empirical estimation of an appropriate null hypothesis. The empirical null may be considerably more dispersed than the usual theoretical null distribution that would be used for any one case considered separately. An empirical Bayes analysis plan for this situation is developed, using a local version of the false discovery rate to examine the inference issues. Two genomics problems are used as examples to show the importance of correctly choosing the null hypothesis.
引用
收藏
页码:96 / 104
页数:9
相关论文
共 50 条
[31]   Large-scale signal detection: A unified perspective [J].
Mukhopadhyay, Subhadeep .
BIOMETRICS, 2016, 72 (02) :325-334
[32]   Empirical Bayesian Selection of Hypothesis Testing Procedures for Analysis of Sequence Count Expression Data [J].
Pounds, Stanley B. ;
Gao, Cuilan L. ;
Zhang, Hui .
STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2012, 11 (05)
[33]   Posterior Probability of Discovery and Expected Rate of Discovery for Multiple Hypothesis Testing and High Throughput Assays [J].
Zhao, Yihua .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (495) :984-996
[34]   OPTIMAL FALSE DISCOVERY RATE CONTROL FOR LARGE SCALE MULTIPLE TESTING WITH AUXILIARY INFORMATION [J].
Cao, Hongyuan ;
Chen, Jun ;
Zhang, Xianyang .
ANNALS OF STATISTICS, 2022, 50 (02) :807-857
[35]   Asymptotic Properties of Bayes Risk of a General Class of Shrinkage Priors in Multiple Hypothesis Testing Under Sparsity [J].
Ghosh, Prasenjit ;
Tang, Xueying ;
Ghosh, Malay ;
Chakrabarti, Arijit .
BAYESIAN ANALYSIS, 2016, 11 (03) :753-796
[36]   Making the cut: improved ranking and selection for large-scale inference [J].
Henderson, Nicholas C. ;
Newton, Michael A. .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2016, 78 (04) :781-804
[37]   A nonparametric empirical Bayes approach to large-scale multivariate regression [J].
Wang, Yihe ;
Zhao, Sihai Dave .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2021, 156
[38]   Molecular Heterogeneity in Large-Scale Biological Data: Techniques and Applications [J].
Deng, Chao ;
Daley, Timothy ;
Brandine, Guilherme De Sena ;
Smith, Andrew D. .
ANNUAL REVIEW OF BIOMEDICAL DATA SCIENCE, VOL 2, 2019, 2019, 2 :39-67
[39]   covRNA: discovering covariate associations in large-scale gene expression data [J].
Urban, Lara ;
Remmele, Christian W. ;
Dittrich, Marcus ;
Schwarz, Roland F. ;
Mueller, Tobias .
BMC RESEARCH NOTES, 2020, 13 (01)
[40]   covRNA: discovering covariate associations in large-scale gene expression data [J].
Lara Urban ;
Christian W. Remmele ;
Marcus Dittrich ;
Roland F. Schwarz ;
Tobias Müller .
BMC Research Notes, 13