Interaction screening in high-dimensional multi-response regression via projected distance correlation

被引:0
作者
Liu, Lili [1 ,2 ]
Lin, Lu [3 ]
Liu, Lei [1 ]
机构
[1] Washington Univ St Louis, Div Biostat, St Louis, MO USA
[2] Shandong Univ, Res Ctr Math & Interdisciplinary Sci, Qingdao, Peoples R China
[3] Shandong Univ, Zhongtai Secur Inst Financial Studies, Jinan, Peoples R China
基金
中国博士后科学基金;
关键词
High dimensionality; interaction screening; Multi-response regression; Variable selection; VARIABLE SELECTION; QUANTILE REGRESSION; SNP INTERACTIONS; LINEAR-MODELS; DEPENDENCE; LASSO;
D O I
10.1080/03610918.2024.2393691
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Interaction screening for high-dimensional data is a challenging issue, especially for the strongly correlated predictors. A new two-stage interaction screening procedure based on the projected distance correlation is proposed when the predictors are highly correlated. To remove the confounding effect from the target variable that is induced by its correlated variables, we project the predictors and responses onto a conditional set. Our method can successfully identify important variables when the variables are highly correlated, and it can also identify variables that make a contribution to the response conditionally but not marginally. Moreover, our method is computationally efficient and simple, generally applicable without the requirement of the heredity assumption. Theoretical results show that the proposed method can yield the sure screening property. Simulation studies and real data analysis demonstrate the utility and validity of our method.
引用
收藏
页数:26
相关论文
共 33 条
  • [1] SIMULTANEOUS ANALYSIS OF LASSO AND DANTZIG SELECTOR
    Bickel, Peter J.
    Ritov, Ya'acov
    Tsybakov, Alexandre B.
    [J]. ANNALS OF STATISTICS, 2009, 37 (04) : 1705 - 1732
  • [2] A LASSO FOR HIERARCHICAL INTERACTIONS
    Bien, Jacob
    Taylor, Jonathan
    Tibshirani, Robert
    [J]. ANNALS OF STATISTICS, 2013, 41 (03) : 1111 - 1141
  • [3] Variable selection in high-dimensional linear models: partially faithful distributions and the PC-simple algorithm
    Buehlmann, P.
    Kalisch, M.
    Maathuis, M. H.
    [J]. BIOMETRIKA, 2010, 97 (02) : 261 - 278
  • [4] Variable Selection With the Strong Heredity Constraint and Its Oracle Property
    Choi, Nam Hee
    Li, William
    Zhu, Ji
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2010, 105 (489) : 354 - 364
  • [5] Detecting gene-gene interactions that underlie human diseases
    Cordell, Heather J.
    [J]. NATURE REVIEWS GENETICS, 2009, 10 (06) : 392 - 404
  • [6] Sure independence screening for ultrahigh dimensional feature space
    Fan, Jianqing
    Lv, Jinchi
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 : 849 - 883
  • [7] A projection-based conditional dependence measure with applications to high-dimensional undirected graphical models
    Fan, Jianqing
    Feng, Yang
    Xia, Lucy
    [J]. JOURNAL OF ECONOMETRICS, 2020, 218 (01) : 119 - 139
  • [8] On selecting interacting features from high-dimensional data
    Hall, Peter
    Xue, Jing-Hao
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 71 : 694 - 708
  • [9] Model Selection for High-Dimensional Quadratic Regression via Regularization
    Hao, Ning
    Feng, Yang
    Zhang, Hao Helen
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2018, 113 (522) : 615 - 625
  • [10] Interaction Screening for Ultrahigh-Dimensional Data
    Hao, Ning
    Zhang, Hao Helen
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2014, 109 (507) : 1285 - 1301