Sparse sliced inverse regression for high dimensional data analysis

被引:1
|
作者
Hilafu, Haileab [1 ]
Safo, Sandra E. [2 ]
机构
[1] Univ Tennessee, Dept Business Analyt & Stat, Knoxville, TN 37996 USA
[2] Univ Minnesota, Div Biostat, Minneapolis, MN 55455 USA
关键词
Semiparametric model; Generalized eigenvalue decomposition; Sliced inverse regression; Linear discriminant analysis; High-dimensional data; REDUCTION;
D O I
10.1186/s12859-022-04700-3
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background Dimension reduction and variable selection play a critical role in the analysis of contemporary high-dimensional data. The semi-parametric multi-index model often serves as a reasonable model for analysis of such high-dimensional data. The sliced inverse regression (SIR) method, which can be formulated as a generalized eigenvalue decomposition problem, offers a model-free estimation approach for the indices in the semi-parametric multi-index model. Obtaining sparse estimates of the eigenvectors that constitute the basis matrix that is used to construct the indices is desirable to facilitate variable selection, which in turn facilitates interpretability and model parsimony. Results To this end, we propose a group-Dantzig selector type formulation that induces row-sparsity to the sliced inverse regression dimension reduction vectors. Extensive simulation studies are carried out to assess the performance of the proposed method, and compare it with other state of the art methods in the literature. Conclusion The proposed method is shown to yield competitive estimation, prediction, and variable selection performance. Three real data applications, including a metabolomics depression study, are presented to demonstrate the method's effectiveness in practice.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Sparse sliced inverse regression for high dimensional data analysis
    Haileab Hilafu
    Sandra E. Safo
    BMC Bioinformatics, 23
  • [2] Online sparse sliced inverse regression for high-dimensional streaming data
    Xu, Jianjun
    Cui, Wenquan
    Cheng, Haoyang
    INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2023, 21 (02)
  • [3] A convex formulation for high-dimensional sparse sliced inverse regression
    Tan, Kean Ming
    Wang, Zhaoran
    Zhang, Tong
    Liu, Han
    Cook, R. Dennis
    BIOMETRIKA, 2018, 105 (04) : 769 - 782
  • [4] Sparse sliced inverse regression
    Li, Lexin
    Nachtsheim, Christopher J.
    TECHNOMETRICS, 2006, 48 (04) : 503 - 510
  • [5] Federated Sufficient Dimension Reduction Through High-Dimensional Sparse Sliced Inverse Regression
    Cui, Wenquan
    Zhao, Yue
    Xu, Jianjun
    Cheng, Haoyang
    COMMUNICATIONS IN MATHEMATICS AND STATISTICS, 2023,
  • [6] On sliced inverse regression with high-dimensional covariates
    Zhu, LX
    Miao, BQ
    Peng, H
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (474) : 630 - 643
  • [7] Sparse Sliced Inverse Regression via Lasso
    Lin, Qian
    Zhao, Zhigen
    Liu, Jun S.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2019, 114 (528) : 1726 - 1739
  • [8] Sliced inverse regression for high-dimensional time series
    Becker, C
    Fried, R
    EXPLORATORY DATA ANALYSIS IN EMPIRICAL RESEARCH, PROCEEDINGS, 2003, : 3 - 11
  • [9] Sliced inverse regression for survival data
    Maya Shevlyakova
    Stephan Morgenthaler
    Statistical Papers, 2014, 55 : 209 - 220
  • [10] Active learning with generalized sliced inverse regression for high-dimensional reliability analysis
    Yin, Jianhua
    Du, Xiaoping
    STRUCTURAL SAFETY, 2022, 94