Bayesian Model Selection in High-Dimensional Settings

被引:161
|
作者
Johnson, Valen E. [1 ,2 ]
Rossell, David [3 ]
机构
[1] Div Head Quantitat Sci, Houston, TX 77030 USA
[2] Univ Texas MD Anderson Canc Ctr, Houston, TX 77030 USA
[3] Inst Res Biomed Barcelona, Biostat & Bioinformat Unit, Barcelona, Spain
关键词
Adaptive LASSO; Dantzig selector; Elastic net; g-prior; Intrinsic Bayes factor; Intrinsic prior; Nonlocal prior; Nonnegative garrote; Oracle; NONCONCAVE PENALIZED LIKELIHOOD; VARIABLE-SELECTION; CONSISTENCY; CONVERGENCE; MOMENTS;
D O I
10.1080/01621459.2012.682536
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Standard assumptions incorporated into Bayesian model selection procedures result in procedures that are not competitive with commonly used penalized likelihood methods. We propose modifications of these methods by imposing nonlocal prior densities on model parameters. We show that the resulting model selection procedures are consistent in linear model settings when the number of possible covariates p is bounded by the number of observations n, a property that has not been extended to other model selection procedures. In addition to consistently identifying the true model, the proposed procedures provide accurate estimates of the posterior probability that each identified model is correct. Through simulation studies, we demonstrate that these model selection procedures perform as well or better than commonly used penalized likelihood methods in a range of simulation settings. Proofs of the primary theorems are provided in the Supplementary Material that is available online.
引用
收藏
页码:649 / 660
页数:12
相关论文
共 50 条
  • [41] PCA consistency for the power spiked model in high-dimensional settings
    Yata, Kazuyoshi
    Aoshima, Makoto
    JOURNAL OF MULTIVARIATE ANALYSIS, 2013, 122 : 334 - 354
  • [42] A systematic review on model selection in high-dimensional regression
    Lee, Eun Ryung
    Cho, Jinwoo
    Yu, Kyusang
    JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2019, 48 (01) : 1 - 12
  • [43] Simultaneous Feature and Model Selection for High-Dimensional Data
    Perolini, Alessandro
    Guerif, Sebastien
    2011 23RD IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2011), 2011, : 47 - 50
  • [44] A Model Selection Criterion for High-Dimensional Linear Regression
    Owrang, Arash
    Jansson, Magnus
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2018, 66 (13) : 3436 - 3446
  • [45] A systematic review on model selection in high-dimensional regression
    Eun Ryung Lee
    Jinwoo Cho
    Kyusang Yu
    Journal of the Korean Statistical Society, 2019, 48 : 1 - 12
  • [46] Automatic model selection for high-dimensional survival analysis
    Lang, M.
    Kotthaus, H.
    Marwedel, P.
    Weihs, C.
    Rahnenfuehrer, J.
    Bischl, B.
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2015, 85 (01) : 62 - 76
  • [47] High-dimensional Gaussian model selection on a Gaussian design
    Verzelen, Nicolas
    ANNALES DE L INSTITUT HENRI POINCARE-PROBABILITES ET STATISTIQUES, 2010, 46 (02): : 480 - 524
  • [48] Sparse Bayesian variable selection in multinomial probit regression model with application to high-dimensional data classification
    Yang Aijun
    Jiang Xuejun
    Xiang Liming
    Lin Jinguan
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (12) : 6137 - 6150
  • [49] Adaptive Lasso in high-dimensional settings
    Lin, Zhengyan
    Xiang, Yanbiao
    Zhang, Caiya
    JOURNAL OF NONPARAMETRIC STATISTICS, 2009, 21 (06) : 683 - 696
  • [50] The EAS approach to variable selection for multivariate response data in high-dimensional settings
    Koner, Salil
    Williams, Jonathan P.
    ELECTRONIC JOURNAL OF STATISTICS, 2023, 17 (02): : 1947 - 1995