A systematic review on model selection in high-dimensional regression

被引:7
作者
Lee, Eun Ryung [1 ]
Cho, Jinwoo [1 ]
Yu, Kyusang [2 ]
机构
[1] Sungkyunkwan Univ, Seoul, South Korea
[2] Konkuk Univ, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
model selection; Penalized methods; LASSO; SCAD; High dimensional regression models; General convex loss; Quadratic margin condition; High level conditions; Model selection consistency; Oracle property; NONCONCAVE PENALIZED LIKELIHOOD; VARIABLE SELECTION; DIVERGING NUMBER; SHRINKAGE; CRITERIA; LASSO;
D O I
10.1016/j.jkss.2018.10.001
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
High dimensional models are getting much attention from diverse research fields involving very many parameters with a moderate size of data. Model selection is an important issue in such a high dimensional data analysis. Recent literature on theoretical understanding of high dimensional models covers a wide range of penalized methods including LASSO and SCAD. This paper presents a systematic overview of the recent development in high dimensional statistical models. We provide a brief review on the recent development of theory, methods, and guideline on applications of several penalized methods. The review includes appropriate settings to be implemented and limitations along with potential solution for each of the reviewed method. In particular, we provide a systematic review of statistical theory of the high dimensional methods by considering a unified high-dimensional modeling framework together with high level conditions. This framework includes (generalized) linear regression and quantile regression as its special cases. We hope our review helps researchers in this field to have a better understanding of the area and provides useful information to future study. (C) 2018 The Korean Statistical Society. Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 29 条
  • [1] SIMULTANEOUS ANALYSIS OF LASSO AND DANTZIG SELECTOR
    Bickel, Peter J.
    Ritov, Ya'acov
    Tsybakov, Alexandre B.
    [J]. ANNALS OF STATISTICS, 2009, 37 (04) : 1705 - 1732
  • [2] Bühlmann P, 2011, SPRINGER SER STAT, P1, DOI 10.1007/978-3-642-20192-9
  • [3] Candes E, 2007, ANN STAT, V35, P2313, DOI 10.1214/009053606000001523
  • [4] Stable signal recovery from incomplete and inaccurate measurements
    Candes, Emmanuel J.
    Romberg, Justin K.
    Tao, Terence
    [J]. COMMUNICATIONS ON PURE AND APPLIED MATHEMATICS, 2006, 59 (08) : 1207 - 1223
  • [5] Extended Bayesian information criteria for model selection with large model spaces
    Chen, Jiahua
    Chen, Zehua
    [J]. BIOMETRIKA, 2008, 95 (03) : 759 - 771
  • [6] EXTENDED BIC FOR SMALL-n-LARGE-P SPARSE GLM
    Chen, Jiahua
    Chen, Zehua
    [J]. STATISTICA SINICA, 2012, 22 (02) : 555 - 574
  • [7] Fan JQ, 2010, STAT SINICA, V20, P101
  • [8] Nonconcave penalized likelihood with a diverging number of parameters
    Fan, JQ
    Peng, H
    [J]. ANNALS OF STATISTICS, 2004, 32 (03) : 928 - 961
  • [9] Variable selection via nonconcave penalized likelihood and its oracle properties
    Fan, JQ
    Li, RZ
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) : 1348 - 1360
  • [10] Kim Y., 2012, J AM STAT ASSOC, V103, P1665