Bayesian Model Selection in High-Dimensional Settings

被引:161
|
作者
Johnson, Valen E. [1 ,2 ]
Rossell, David [3 ]
机构
[1] Div Head Quantitat Sci, Houston, TX 77030 USA
[2] Univ Texas MD Anderson Canc Ctr, Houston, TX 77030 USA
[3] Inst Res Biomed Barcelona, Biostat & Bioinformat Unit, Barcelona, Spain
关键词
Adaptive LASSO; Dantzig selector; Elastic net; g-prior; Intrinsic Bayes factor; Intrinsic prior; Nonlocal prior; Nonnegative garrote; Oracle; NONCONCAVE PENALIZED LIKELIHOOD; VARIABLE-SELECTION; CONSISTENCY; CONVERGENCE; MOMENTS;
D O I
10.1080/01621459.2012.682536
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Standard assumptions incorporated into Bayesian model selection procedures result in procedures that are not competitive with commonly used penalized likelihood methods. We propose modifications of these methods by imposing nonlocal prior densities on model parameters. We show that the resulting model selection procedures are consistent in linear model settings when the number of possible covariates p is bounded by the number of observations n, a property that has not been extended to other model selection procedures. In addition to consistently identifying the true model, the proposed procedures provide accurate estimates of the posterior probability that each identified model is correct. Through simulation studies, we demonstrate that these model selection procedures perform as well or better than commonly used penalized likelihood methods in a range of simulation settings. Proofs of the primary theorems are provided in the Supplementary Material that is available online.
引用
收藏
页码:649 / 660
页数:12
相关论文
共 50 条
  • [1] Bayesian Model Selection in High-Dimensional Settings (vol 107 pg 649, 2012)
    Johnson, Valen E.
    Rossell, David
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2012, 107 (500) : 1656 - 1656
  • [2] Proximal nested sampling for high-dimensional Bayesian model selection
    Xiaohao Cai
    Jason D. McEwen
    Marcelo Pereyra
    Statistics and Computing, 2022, 32
  • [3] High-dimensional Ising model selection with Bayesian information criteria
    Barber, Rina Foygel
    Drton, Mathias
    ELECTRONIC JOURNAL OF STATISTICS, 2015, 9 (01): : 567 - 607
  • [4] Proximal nested sampling for high-dimensional Bayesian model selection
    Cai, Xiaohao
    McEwen, Jason D.
    Pereyra, Marcelo
    STATISTICS AND COMPUTING, 2022, 32 (05)
  • [5] Bayesian model selection for high-dimensional Ising models, with applications to educational data
    Park, Jaewoo
    Jin, Ick Hoon
    Schweinberger, Michael
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2022, 165
  • [6] Bayesian variable selection in multinomial probit model for classifying high-dimensional data
    Aijun Yang
    Yunxian Li
    Niansheng Tang
    Jinguan Lin
    Computational Statistics, 2015, 30 : 399 - 418
  • [7] Bayesian variable selection and model averaging in high-dimensional multinomial nonparametric regression
    Yau, P
    Kohn, R
    Wood, S
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2003, 12 (01) : 23 - 54
  • [8] Posterior model consistency in high-dimensional Bayesian variable selection with arbitrary priors
    Hua, Min
    Goh, Gyuhyeong
    STATISTICS & PROBABILITY LETTERS, 2025, 223
  • [9] Composite Likelihood Bayesian Information Criteria for Model Selection in High-Dimensional Data
    Gao, Xin
    Song, Peter X. -K.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2010, 105 (492) : 1531 - 1540
  • [10] Bayesian variable selection in multinomial probit model for classifying high-dimensional data
    Yang, Aijun
    Li, Yunxian
    Tang, Niansheng
    Lin, Jinguan
    COMPUTATIONAL STATISTICS, 2015, 30 (02) : 399 - 418