VARIABLE SELECTION VIA PARTIAL CORRELATION

被引:12
作者
Li, Runze [1 ,2 ]
Liu, Jingyuan [3 ,4 ]
Lou, Lejia [5 ]
机构
[1] Penn State Univ, Dept Stat, University Pk, PA 16802 USA
[2] Penn State Univ, Methodol Ctr, University Pk, PA 16802 USA
[3] Xiamen Univ, Dept Stat, Sch Econ, Wang Yanan Inst Studies Econ, Xiamen 361005, Peoples R China
[4] Xiamen Univ, Fujian Key Lab Stat Sci, Xiamen 361005, Peoples R China
[5] Ernst & Young, 5 Times Sq, New York, NY 10036 USA
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Elliptical distribution; model selection consistency; partial correlation; partial faithfulness; sure screening property; ultrahigh dimensional linear model; variable selection; BAYESIAN-INFERENCE; ADAPTIVE LASSO; LINEAR-MODELS;
D O I
10.5705/ss.202015.0473
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A partial correlation-based variable selection method was proposed for normal linear regression models by Biihlmann, Kalisch and Maathuis (2010) as an alternative to regularization methods for variable selection. This paper addresses issues related to (a) whether the method is sensitive to the normality assumption, and (b) whether the method is valid when the dimension of predictor increases at an exponential rate in the sample size. To address (a), we study the method for elliptical linear regression models. Our finding indicates that the original proposal can lead to inferior performance when the marginal kurtosis of predictor is not close to that of normal distribution, and simulation results confirm this. To ensure the superior performance of the partial correlation-based variable selection procedure, we propose a thresholded partial correlation (TPC) approach to select significant variables in linear regression models. We establish the selection consistency of the TPC in the presence of ultrahigh dimensional predictors. Since the TPC procedure includes the original proposal as a special case, our results address the issue (b) directly. As a by-product, the sure screening property of the first step of TPC is obtained. Numerical examples illustrate that the TPC is comparable to the commonly-used regularization methods for variable selection.
引用
收藏
页码:983 / 996
页数:14
相关论文
共 50 条
  • [31] A Partial Least Squares based algorithm for parsimonious variable selection
    Tahir Mehmood
    Harald Martens
    Solve Sæbø
    Jonas Warringer
    Lars Snipen
    Algorithms for Molecular Biology, 6
  • [32] Variable selection for partial least squares modeling by genetic algorithms
    Chu, XL
    Yuan, HF
    Wang, YB
    Lu, WZ
    CHINESE JOURNAL OF ANALYTICAL CHEMISTRY, 2001, 29 (04) : 437 - 442
  • [33] Bayesian variable selection in generalized additive partial linear models
    Banerjee, Sayantan
    Ghosal, Subhashis
    STAT, 2014, 3 (01): : 363 - 378
  • [34] Variable selection for the partial linear single-index model
    Wang, Wu
    Zhu, Zhong-yi
    ACTA MATHEMATICAE APPLICATAE SINICA-ENGLISH SERIES, 2017, 33 (02): : 373 - 388
  • [35] Variable selection in monotone single-index models via the adaptive LASSO
    Foster, Jared C.
    Taylor, Jeremy M. G.
    Nan, Bin
    STATISTICS IN MEDICINE, 2013, 32 (22) : 3944 - 3954
  • [36] Comparison of variable selection methods in partial least squares regression
    Mehmood, Tahir
    Saebo, Solve
    Liland, Kristian Hovde
    JOURNAL OF CHEMOMETRICS, 2020, 34 (06)
  • [37] Robust feature screening for varying coefficient models via quantile partial correlation
    Li, Xiang-Jie
    Ma, Xue-Jun
    Zhang, Jing-Xiao
    METRIKA, 2017, 80 (01) : 17 - 49
  • [38] Correlation pursuit: forward stepwise variable selection for index models
    Zhong, Wenxuan
    Zhang, Tingting
    Zhu, Yu
    Liu, Jun S.
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2012, 74 : 849 - 870
  • [39] Structured Variable Selection for Regularized Generalized Canonical Correlation Analysis
    Lofstedt, Tommy
    Hadj-Selem, Fouad
    Guillemot, Vincent
    Philippe, Cathy
    Duchesnay, Edouard
    Frouin, Vincent
    Tenenhaus, Arthur
    MULTIPLE FACETS OF PARTIAL LEAST SQUARES AND RELATED METHODS, 2016, 173 : 129 - 139
  • [40] DIFFERENTIALLY PRIVATE VARIABLE SELECTION VIA THE KNOCKOFF FILTER
    Pournaderi, Mehrdad
    Xiang, Yu
    2021 IEEE 31ST INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2021,