Bayesian Model Selection in High-Dimensional Settings

被引:161
|
作者
Johnson, Valen E. [1 ,2 ]
Rossell, David [3 ]
机构
[1] Div Head Quantitat Sci, Houston, TX 77030 USA
[2] Univ Texas MD Anderson Canc Ctr, Houston, TX 77030 USA
[3] Inst Res Biomed Barcelona, Biostat & Bioinformat Unit, Barcelona, Spain
关键词
Adaptive LASSO; Dantzig selector; Elastic net; g-prior; Intrinsic Bayes factor; Intrinsic prior; Nonlocal prior; Nonnegative garrote; Oracle; NONCONCAVE PENALIZED LIKELIHOOD; VARIABLE-SELECTION; CONSISTENCY; CONVERGENCE; MOMENTS;
D O I
10.1080/01621459.2012.682536
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Standard assumptions incorporated into Bayesian model selection procedures result in procedures that are not competitive with commonly used penalized likelihood methods. We propose modifications of these methods by imposing nonlocal prior densities on model parameters. We show that the resulting model selection procedures are consistent in linear model settings when the number of possible covariates p is bounded by the number of observations n, a property that has not been extended to other model selection procedures. In addition to consistently identifying the true model, the proposed procedures provide accurate estimates of the posterior probability that each identified model is correct. Through simulation studies, we demonstrate that these model selection procedures perform as well or better than commonly used penalized likelihood methods in a range of simulation settings. Proofs of the primary theorems are provided in the Supplementary Material that is available online.
引用
收藏
页码:649 / 660
页数:12
相关论文
共 50 条
  • [21] Scalable Bayesian variable selection for structured high-dimensional data
    Chang, Changgee
    Kundu, Suprateek
    Long, Qi
    BIOMETRICS, 2018, 74 (04) : 1372 - 1382
  • [22] Bayesian Regression Trees for High-Dimensional Prediction and Variable Selection
    Linero, Antonio R.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2018, 113 (522) : 626 - 636
  • [23] Bayesian variable selection in clustering high-dimensional data with substructure
    Michael D. Swartz
    Qianxing Mo
    Mary E. Murphy
    Joanne R. Lupton
    Nancy D. Turner
    Mee Young Hong
    Marina Vannucci
    Journal of Agricultural, Biological, and Environmental Statistics, 2008, 13 : 407 - 423
  • [24] Sparse Bayesian variable selection for classifying high-dimensional data
    Yang, Aijun
    Lian, Heng
    Jiang, Xuejun
    Liu, Pengfei
    STATISTICS AND ITS INTERFACE, 2018, 11 (02) : 385 - 395
  • [25] NONPENALIZED VARIABLE SELECTION IN HIGH-DIMENSIONAL LINEAR MODEL SETTINGS VIA GENERALIZED FIDUCIAL INFERENCE
    Williams, Jonathan P.
    Hannig, Jan
    ANNALS OF STATISTICS, 2019, 47 (03): : 1723 - 1753
  • [26] Intrinsic Bayesian model for high-dimensional unsupervised reduction
    Jin, Longcun
    Wan, Wanggen
    Wu, Yongliang
    Cui, Bin
    Yu, Xiaoqing
    NEUROCOMPUTING, 2012, 98 : 143 - 150
  • [27] Joint Diagnosis of High-Dimensional Process Mean and Covariance Matrix based on Bayesian Model Selection
    Xu, Feng
    Shu, Lianjie
    Li, Yanting
    Wang, Binhui
    TECHNOMETRICS, 2023, 65 (04) : 465 - 479
  • [28] Model Based Screening Embedded Bayesian Variable Selection for Ultra-high Dimensional Settings
    Li, Dongjin
    Dutta, Somak
    Roy, Vivekananda
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2023, 32 (01) : 61 - 73
  • [29] High-Dimensional Bayesian Geostatistics
    Banerjee, Sudipto
    BAYESIAN ANALYSIS, 2017, 12 (02): : 583 - 614
  • [30] Feature selection using autoencoders with Bayesian methods to high-dimensional data
    Shu, Lei
    Huang, Kun
    Jiang, Wenhao
    Wu, Wenming
    Liu, Hongling
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (06) : 7397 - 7406