A Variable Selection Method for High-Dimensional Survival Data

被引:1
|
作者
Giordano, Francesco [1 ]
Milito, Sara [1 ]
Restaino, Marialuisa [1 ]
机构
[1] Univ Salerno, Via Giovanni Paolo II 132, I-84084 Salerno, Italy
来源
MATHEMATICAL AND STATISTICAL METHODS FOR ACTUARIAL SCIENCES AND FINANCE, MAF 2022 | 2022年
关键词
Variable selection; High-dimension; Survival data;
D O I
10.1007/978-3-030-99638-3_49
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
Survival data with high-dimensional predictors are regularly collected in many studies. Models with a very large number of covariates are both infeasible to fit and likely to incur low predictability due to overfitting. The selection of significant variables plays a crucial role in estimating models. Even if several approaches that identify variables in presence of censored data are available in literature, there is not unanimous consensus on which method outperforms the others. Nonetheless, it is possible to exploit the advantages of methods to get the final set of covariates as good as possible. Therefore, we propose a method that combines different variable selection procedures by using the subsampling technique, for identifying as relevant those covariates that are selected most frequently by the different variable selectors on subsampled data. By a simulation study, we evaluate the performance of the proposed procedure and compare it with other techniques.
引用
收藏
页码:303 / 308
页数:6
相关论文
共 50 条
  • [1] Variable selection for high-dimensional incomplete data
    Liang, Lixing
    Zhuang, Yipeng
    Yu, Philip L. H.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2024, 192
  • [2] Variable selection for longitudinal data with high-dimensional covariates and dropouts
    Zheng, Xueying
    Fu, Bo
    Zhang, Jiajia
    Qin, Guoyou
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2018, 88 (04) : 712 - 725
  • [3] Comparison of variable selection methods for high-dimensional survival data with competing events
    Gilhodes, Julia
    Zemmour, Christophe
    Ajana, Soufiane
    Martinez, Alejandra
    Delord, Jean-Pierre
    Leconte, Eve
    Boher, Jean-Marie
    Filleron, Thomas
    COMPUTERS IN BIOLOGY AND MEDICINE, 2017, 91 : 159 - 167
  • [4] VARIABLE SELECTION AND PREDICTION WITH INCOMPLETE HIGH-DIMENSIONAL DATA
    Liu, Ying
    Wang, Yuanjia
    Feng, Yang
    Wall, Melanie M.
    ANNALS OF APPLIED STATISTICS, 2016, 10 (01) : 418 - 450
  • [5] Stochastic variational variable selection for high-dimensional microbiome data
    Dang, Tung
    Kumaishi, Kie
    Usui, Erika
    Kobori, Shungo
    Sato, Takumi
    Toda, Yusuke
    Yamasaki, Yuji
    Tsujimoto, Hisashi
    Ichihashi, Yasunori
    Iwata, Hiroyoshi
    MICROBIOME, 2022, 10 (01)
  • [6] Stochastic variational variable selection for high-dimensional microbiome data
    Tung Dang
    Kie Kumaishi
    Erika Usui
    Shungo Kobori
    Takumi Sato
    Yusuke Toda
    Yuji Yamasaki
    Hisashi Tsujimoto
    Yasunori Ichihashi
    Hiroyoshi Iwata
    Microbiome, 10
  • [7] An ensemble learning method for variable selection: application to high-dimensional data and missing values
    Bar-Hen, Avner
    Audigier, Vincent
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2022, 92 (16) : 3488 - 3510
  • [8] PALLADIO: a parallel framework for robust variable selection in high-dimensional data
    Barbieri, Matteo
    Fiorini, Samuele
    Tomasi, Federico
    Barla, Annalisa
    PROCEEDINGS OF PYHPC2016: 6TH WORKSHOP ON PYTHON FOR HIGH-PERFORMANCE AND SCIENTIFIC COMPUTING, 2016, : 19 - 26
  • [9] Variable selection techniques after multiple imputation in high-dimensional data
    Faisal Maqbool Zahid
    Shahla Faisal
    Christian Heumann
    Statistical Methods & Applications, 2020, 29 : 553 - 580
  • [10] Stable Variable Selection for High-Dimensional Genomic Data with Strong Correlations
    Sarkar R.
    Manage S.
    Gao X.
    Annals of Data Science, 2024, 11 (04) : 1139 - 1164