Making the cut: improved ranking and selection for large-scale inference

被引:11
|
作者
Henderson, Nicholas C. [1 ]
Newton, Michael A. [1 ]
机构
[1] Univ Wisconsin, Madison, WI 53706 USA
基金
美国国家卫生研究院;
关键词
Empirical Bayes; Posterior expected rank; r-value; 2-STAGE; MODEL;
D O I
10.1111/rssb.12131
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Identifying leading measurement units from a large collection is a common inference task in various domains of large-scale inference. Testing approaches, which measure evidence against a null hypothesis rather than effect magnitude, tend to overpopulate lists of leading units with those associated with low measurement error. By contrast, local maximum likelihood approaches tend to favour units with high measurement error. Available Bayesian and empirical Bayesian approaches rely on specialized loss functions that result in similar deficiencies. We describe and evaluate a generic empirical Bayesian ranking procedure that populates the list of top units in a way that maximizes the expected overlap between the true and reported top lists for all list sizes. The procedure relates unit-specific posterior upper tail probabilities with their empirical distribution to yield a ranking variable. It discounts high variance units less than popular non-maximum-likelihood methods and thus achieves improved operating characteristics in the models considered.
引用
收藏
页码:781 / 804
页数:24
相关论文
共 50 条
  • [1] LARGE-SCALE RANKING AND SELECTION USING CLOUD COMPUTING
    Luo, Jun
    Hong, L. Jeff
    PROCEEDINGS OF THE 2011 WINTER SIMULATION CONFERENCE (WSC), 2011, : 4046 - 4056
  • [2] The (Surprising) Sample Optimality of Greedy Procedures for Large-Scale Ranking and Selection
    Li, Zaile
    Fan, Weiwei
    Hong, L. Jeff
    MANAGEMENT SCIENCE, 2024,
  • [3] Solving Large-Scale Fixed-Budget Ranking and Selection Problems
    Hong, L. Jeff
    Jiang, Guangxin
    Zhong, Ying
    INFORMS JOURNAL ON COMPUTING, 2022, 34 (06) : 2930 - 2949
  • [4] Covariate-assisted ranking and screening for large-scale two-sample inference
    Cai, T. Tony
    Sun, Wenguang
    Wang, Weinan
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2019, 81 (02) : 187 - 234
  • [5] RichMind: A Tool for Improved Inference from Large-Scale Neuroimaging Results
    Maron-Katz, Adi
    Amar, David
    Ben Simon, Eti
    Hendler, Talma
    Shamir, Ron
    PLOS ONE, 2016, 11 (07):
  • [6] COMPARING MESSAGE PASSING INTERFACE AND MAPREDUCE FOR LARGE-SCALE PARALLEL RANKING AND SELECTION
    Ni, Eric C.
    Ciocan, Dragos F.
    Henderson, Shane G.
    Hunter, Susan R.
    2015 WINTER SIMULATION CONFERENCE (WSC), 2015, : 3858 - 3867
  • [7] An improved sequential backward selection algorithm for large-scale observation selection problems
    Reeves, SJ
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 1657 - 1660
  • [8] LARGE-SCALE INFERENCE WITH BLOCK STRUCTURE
    Kou, Jiyao
    Walther, Guenther
    ANNALS OF STATISTICS, 2022, 50 (03): : 1541 - 1572
  • [9] Optimal group selection model for large-scale group decision making
    Wu, Peng
    Wu, Qun
    Zhou, Ligang
    Chen, Huayou
    INFORMATION FUSION, 2020, 61 : 1 - 12
  • [10] Knockout-Tournament Procedures for Large-Scale Ranking and Selection in Parallel Computing Environments
    Zhong, Ying
    Hong, L. Jeff
    OPERATIONS RESEARCH, 2022, 70 (01) : 432 - 453