A Variable Selection Method for High-Dimensional Survival Data

被引:1
|
作者
Giordano, Francesco [1 ]
Milito, Sara [1 ]
Restaino, Marialuisa [1 ]
机构
[1] Univ Salerno, Via Giovanni Paolo II 132, I-84084 Salerno, Italy
来源
MATHEMATICAL AND STATISTICAL METHODS FOR ACTUARIAL SCIENCES AND FINANCE, MAF 2022 | 2022年
关键词
Variable selection; High-dimension; Survival data;
D O I
10.1007/978-3-030-99638-3_49
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
Survival data with high-dimensional predictors are regularly collected in many studies. Models with a very large number of covariates are both infeasible to fit and likely to incur low predictability due to overfitting. The selection of significant variables plays a crucial role in estimating models. Even if several approaches that identify variables in presence of censored data are available in literature, there is not unanimous consensus on which method outperforms the others. Nonetheless, it is possible to exploit the advantages of methods to get the final set of covariates as good as possible. Therefore, we propose a method that combines different variable selection procedures by using the subsampling technique, for identifying as relevant those covariates that are selected most frequently by the different variable selectors on subsampled data. By a simulation study, we evaluate the performance of the proposed procedure and compare it with other techniques.
引用
收藏
页码:303 / 308
页数:6
相关论文
共 50 条
  • [11] Variable selection via combined penalization for high-dimensional data analysis
    Wang, Xiaoming
    Park, Taesung
    Carriere, K. C.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2010, 54 (10) : 2230 - 2243
  • [12] LASSO-type variable selection methods for high-dimensional data
    Fu, Guanghui
    Wang, Pan
    ADVANCES IN COMPUTATIONAL MODELING AND SIMULATION, PTS 1 AND 2, 2014, 444-445 : 604 - 609
  • [13] GIBBS POSTERIOR FOR VARIABLE SELECTION IN HIGH-DIMENSIONAL CLASSIFICATION AND DATA MINING
    Jiang, Wenxin
    Tanner, Martin A.
    ANNALS OF STATISTICS, 2008, 36 (05) : 2207 - 2231
  • [14] Variable Selection in High-Dimensional Partially Linear Models with Longitudinal Data
    Yang Yiping
    Xue Liugen
    RECENT ADVANCE IN STATISTICS APPLICATION AND RELATED AREAS, VOLS I AND II, 2009, : 661 - 667
  • [15] Variable selection techniques after multiple imputation in high-dimensional data
    Zahid, Faisal Maqbool
    Faisal, Shahla
    Heumann, Christian
    STATISTICAL METHODS AND APPLICATIONS, 2020, 29 (03) : 553 - 580
  • [16] An improved variable selection procedure for adaptive Lasso in high-dimensional survival analysis
    He, Kevin
    Wang, Yue
    Zhou, Xiang
    Xu, Han
    Huang, Can
    LIFETIME DATA ANALYSIS, 2019, 25 (03) : 569 - 585
  • [17] An improved variable selection procedure for adaptive Lasso in high-dimensional survival analysis
    Kevin He
    Yue Wang
    Xiang Zhou
    Han Xu
    Can Huang
    Lifetime Data Analysis, 2019, 25 : 569 - 585
  • [18] A consistent variable selection method in high-dimensional canonical discriminant analysis
    Oda, Ryoya
    Suzuki, Yuya
    Yanagihara, Hirokazu
    Fujikoshi, Yasunori
    JOURNAL OF MULTIVARIATE ANALYSIS, 2020, 175
  • [19] Variable selection for high-dimensional incomplete data using horseshoe estimation with data augmentation
    Zhang, Yunxi
    Kim, Soeun
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2024, 53 (12) : 4235 - 4251
  • [20] Quantile forward regression for high-dimensional survival data
    Lee, Eun Ryung
    Park, Seyoung
    Lee, Sang Kyu
    Hong, Hyokyoung G.
    LIFETIME DATA ANALYSIS, 2023, 29 (04) : 769 - 806