High-Dimensional Variable Selection With Competing Events Using Cooperative Penalized Regression

被引:0
|
作者
Burk, Lukas [1 ,2 ,3 ,4 ]
Bender, Andreas [2 ,4 ]
Wright, Marvin N. [1 ,2 ,5 ]
机构
[1] Leibniz Inst Prevent Res & Epidemiol BIPS, Bremen, Germany
[2] Ludwig Maximilians Univ Munchen, Dept Stat, Munich, Germany
[3] Univ Bremen, Fac Math & Comp Sci, Bremen, Germany
[4] Munich Ctr Machine Learning MCML, Munich, Germany
[5] Univ Copenhagen, Dept Publ Hlth, Copenhagen, Denmark
关键词
competing risks; Cox regression; high-dimensional data analysis; penalized regression; variable selection; REGULARIZATION PATHS; FORESTS;
D O I
10.1002/bimj.70036
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Variable selection is an important step in the analysis of high-dimensional data, yet there are limited options for survival outcomes in the presence of competing risks. Commonly employed penalized Cox regression considers each event type separately through cause-specific models, neglecting possibly shared information between them. We adapt the feature-weighted elastic net (fwelnet), an elastic net generalization, to survival outcomes and competing risks. For two causes, our proposed algorithm fits two alternating cause-specific models, where each model receives the coefficient vector of the complementary model as prior information. We dub this "cooperative penalized regression," as it enables the modeling of competing risk data with cause-specific models while accounting for shared effects between causes. Coefficients that are shrunken toward zero in the model for the first cause will receive larger penalization weights in the model for the second cause and vice versa. Through multiple iterations, this process ensures stronger penalization of uninformative predictors in both models. We demonstrate our method's variable selection capabilities on simulated genomics data and apply it to bladder cancer microarray data. We evaluate selection performance using the positive predictive value for the correct selection of informative features and the false positive rate for the selection of uninformative variables. The benchmark compares results with cause-specific penalized Cox regression, random survival forests, and likelihood-boosted Cox regression. Results indicate that our approach is more effective at selecting informative features and removing uninformative features. In settings without shared effects, variable selection performance is similar to cause-specific penalized Cox regression.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] High-dimensional macroeconomic forecasting and variable selection via penalized regression
    Uematsu, Yoshimasa
    Tanaka, Shinya
    ECONOMETRICS JOURNAL, 2019, 22 (01): : 34 - +
  • [2] Penalized variable selection in competing risks regression
    Fu, Zhixuan
    Parikh, Chirag R.
    Zhou, Bingqing
    LIFETIME DATA ANALYSIS, 2017, 23 (03) : 353 - 376
  • [3] Penalized variable selection in competing risks regression
    Zhixuan Fu
    Chirag R. Parikh
    Bingqing Zhou
    Lifetime Data Analysis, 2017, 23 : 353 - 376
  • [4] SCAD-penalized quantile regression for high-dimensional data analysis and variable selection
    Amin, Muhammad
    Song, Lixin
    Thorlie, Milton Abdul
    Wang, Xiaoguang
    STATISTICA NEERLANDICA, 2015, 69 (03) : 212 - 235
  • [5] Comparison of variable selection methods for high-dimensional survival data with competing events
    Gilhodes, Julia
    Zemmour, Christophe
    Ajana, Soufiane
    Martinez, Alejandra
    Delord, Jean-Pierre
    Leconte, Eve
    Boher, Jean-Marie
    Filleron, Thomas
    COMPUTERS IN BIOLOGY AND MEDICINE, 2017, 91 : 159 - 167
  • [6] Penalized estimation for competing risks regression with applications to high-dimensional covariates
    Ambrogi, Federico
    Scheike, Thomas H.
    BIOSTATISTICS, 2016, 17 (04) : 708 - 721
  • [7] High-Dimensional Regression and Variable Selection Using CAR Scores
    Zuber, Verena
    Strimmer, Korbinian
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2011, 10 (01)
  • [8] Variable Selection via SCAD-Penalized Quantile Regression for High-Dimensional Count Data
    Khan, Dost Muhammad
    Yaqoob, Anum
    Iqbal, Nadeem
    Wahid, Abdul
    Khalil, Umair
    Khan, Mukhtaj
    Abd Rahman, Mohd Amiruddin
    Mustafa, Mohd Shafie
    Khan, Zardad
    IEEE ACCESS, 2019, 7 : 153205 - 153216
  • [9] A stepwise regression algorithm for high-dimensional variable selection
    Hwang, Jing-Shiang
    Hu, Tsuey-Hwa
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2015, 85 (09) : 1793 - 1806
  • [10] Variable Selection Diagnostics Measures for High-Dimensional Regression
    Nan, Ying
    Yang, Yuhong
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2014, 23 (03) : 636 - 656