VARIABLE SELECTION VIA PARTIAL CORRELATION

被引:12
作者
Li, Runze [1 ,2 ]
Liu, Jingyuan [3 ,4 ]
Lou, Lejia [5 ]
机构
[1] Penn State Univ, Dept Stat, University Pk, PA 16802 USA
[2] Penn State Univ, Methodol Ctr, University Pk, PA 16802 USA
[3] Xiamen Univ, Dept Stat, Sch Econ, Wang Yanan Inst Studies Econ, Xiamen 361005, Peoples R China
[4] Xiamen Univ, Fujian Key Lab Stat Sci, Xiamen 361005, Peoples R China
[5] Ernst & Young, 5 Times Sq, New York, NY 10036 USA
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Elliptical distribution; model selection consistency; partial correlation; partial faithfulness; sure screening property; ultrahigh dimensional linear model; variable selection; BAYESIAN-INFERENCE; ADAPTIVE LASSO; LINEAR-MODELS;
D O I
10.5705/ss.202015.0473
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A partial correlation-based variable selection method was proposed for normal linear regression models by Biihlmann, Kalisch and Maathuis (2010) as an alternative to regularization methods for variable selection. This paper addresses issues related to (a) whether the method is sensitive to the normality assumption, and (b) whether the method is valid when the dimension of predictor increases at an exponential rate in the sample size. To address (a), we study the method for elliptical linear regression models. Our finding indicates that the original proposal can lead to inferior performance when the marginal kurtosis of predictor is not close to that of normal distribution, and simulation results confirm this. To ensure the superior performance of the partial correlation-based variable selection procedure, we propose a thresholded partial correlation (TPC) approach to select significant variables in linear regression models. We establish the selection consistency of the TPC in the presence of ultrahigh dimensional predictors. Since the TPC procedure includes the original proposal as a special case, our results address the issue (b) directly. As a by-product, the sure screening property of the first step of TPC is obtained. Numerical examples illustrate that the TPC is comparable to the commonly-used regularization methods for variable selection.
引用
收藏
页码:983 / 996
页数:14
相关论文
共 50 条
  • [41] Optimized variable selection via repeated data splitting
    Capanu, Marinela
    Giurcanu, Mihai
    Begg, Colin B.
    Gonen, Mithat
    STATISTICS IN MEDICINE, 2020, 39 (16) : 2167 - 2184
  • [42] Variable selection in regression via repeated data splitting
    Thall, PF
    Russell, KE
    Simon, RM
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 1997, 6 (04) : 416 - 434
  • [43] Variable selection in classification model via quadratic programming
    Huang, Jun
    Wang, Haibo
    Wang, Wei
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2018, 47 (07) : 1922 - 1939
  • [44] Nonlinear Variable Selection via Deep Neural Networks
    Chen, Yao
    Gao, Qingyi
    Liang, Faming
    Wang, Xiao
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2021, 30 (02) : 484 - 492
  • [45] Variable selection and coefficient estimation via composite quantile regression with randomly censored data
    Jiang, Rong
    Qian, Weimin
    Zhou, Zhangong
    STATISTICS & PROBABILITY LETTERS, 2012, 82 (02) : 308 - 317
  • [46] A Robust Variable Selection Method for Sparse Online Regression via the Elastic Net Penalty
    Wang, Wentao
    Liang, Jiaxuan
    Liu, Rong
    Song, Yunquan
    Zhang, Min
    MATHEMATICS, 2022, 10 (16)
  • [47] Variable selection approach for zero-inflated count data via adaptive lasso
    Zeng, Ping
    Wei, Yongyue
    Zhao, Yang
    Liu, Jin
    Liu, Liya
    Zhang, Ruyang
    Gou, Jianwei
    Huang, Shuiping
    Chen, Feng
    JOURNAL OF APPLIED STATISTICS, 2014, 41 (04) : 879 - 894
  • [48] Use of Partial Least Squares Regression for Variable Selection and Quality Prediction
    Jun, Chi-Hyuck
    Lee, Sang-Ho
    Park, Hae-Sang
    Lee, Jeong-Hwa
    CIE: 2009 INTERNATIONAL CONFERENCE ON COMPUTERS AND INDUSTRIAL ENGINEERING, VOLS 1-3, 2009, : 1302 - 1307
  • [49] Partial least squares regression with conditional orthogonal projection for variable selection
    Wang, Jiangchuan
    Ma, Haiqiang
    Li, Chuanquan
    Liu, Qing
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2024, 53 (12) : 5752 - 5763
  • [50] Variable selection in partial linear regression using the least angle regression
    Seo, Han Son
    Yoon, Min
    Lee, Hakbae
    KOREAN JOURNAL OF APPLIED STATISTICS, 2021, 34 (06) : 937 - 944