A Variable Selection Method for High-Dimensional Survival Data

被引:1
|
作者
Giordano, Francesco [1 ]
Milito, Sara [1 ]
Restaino, Marialuisa [1 ]
机构
[1] Univ Salerno, Via Giovanni Paolo II 132, I-84084 Salerno, Italy
来源
MATHEMATICAL AND STATISTICAL METHODS FOR ACTUARIAL SCIENCES AND FINANCE, MAF 2022 | 2022年
关键词
Variable selection; High-dimension; Survival data;
D O I
10.1007/978-3-030-99638-3_49
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
Survival data with high-dimensional predictors are regularly collected in many studies. Models with a very large number of covariates are both infeasible to fit and likely to incur low predictability due to overfitting. The selection of significant variables plays a crucial role in estimating models. Even if several approaches that identify variables in presence of censored data are available in literature, there is not unanimous consensus on which method outperforms the others. Nonetheless, it is possible to exploit the advantages of methods to get the final set of covariates as good as possible. Therefore, we propose a method that combines different variable selection procedures by using the subsampling technique, for identifying as relevant those covariates that are selected most frequently by the different variable selectors on subsampled data. By a simulation study, we evaluate the performance of the proposed procedure and compare it with other techniques.
引用
收藏
页码:303 / 308
页数:6
相关论文
共 50 条
  • [21] Controlled variable selection in Weibull mixture cure models for high-dimensional data
    Fu, Han
    Nicolet, Deedra
    Mrozek, Krzysztof
    Stone, Richard M.
    Eisfeld, Ann-Kathrin
    Byrd, John C.
    Archer, Kellie J.
    STATISTICS IN MEDICINE, 2022, 41 (22) : 4340 - 4366
  • [22] Robust feature screening for high-dimensional survival data
    Hao, Meiling
    Lin, Yuanyuan
    Liu, Xianhui
    Tang, Wenlu
    JOURNAL OF APPLIED STATISTICS, 2019, 46 (06) : 979 - 994
  • [23] Variable selection in the high-dimensional continuous generalized linear model with current status data
    Tian, Guo-Liang
    Wang, Mingqiu
    Song, Lixin
    JOURNAL OF APPLIED STATISTICS, 2014, 41 (03) : 467 - 483
  • [24] ENNS: Variable Selection, Regression, Classification and Deep Neural Network for High-Dimensional Data
    Yang, Kaixu
    Ganguli, Arkaprabha
    Maiti, Tapabrata
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25
  • [25] Bayesian variable selection in clustering high-dimensional data via a mixture of finite mixtures
    Doo, Woojin
    Kim, Heeyoung
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2021, 91 (12) : 2551 - 2568
  • [26] Feature selection for high-dimensional temporal data
    Tsagris, Michail
    Lagani, Vincenzo
    Tsamardinos, Ioannis
    BMC BIOINFORMATICS, 2018, 19
  • [27] Sparse Bayesian variable selection in kernel probit model for analyzing high-dimensional data
    Yang, Aijun
    Tian, Yuzhu
    Li, Yunxian
    Lin, Jinguan
    COMPUTATIONAL STATISTICS, 2020, 35 (01) : 245 - 258
  • [28] Sparse Bayesian variable selection in kernel probit model for analyzing high-dimensional data
    Aijun Yang
    Yuzhu Tian
    Yunxian Li
    Jinguan Lin
    Computational Statistics, 2020, 35 : 245 - 258
  • [29] Feature selection for high-dimensional temporal data
    Michail Tsagris
    Vincenzo Lagani
    Ioannis Tsamardinos
    BMC Bioinformatics, 19
  • [30] The use of random-effect models for high-dimensional variable selection problems
    Kwon, Sunghoon
    Oh, Seungyoung
    Lee, Youngjo
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2016, 103 : 401 - 412