High-Dimensional Variable Selection With Competing Events Using Cooperative Penalized Regression

被引:0
作者
Burk, Lukas [1 ,2 ,3 ,4 ]
Bender, Andreas [2 ,4 ]
Wright, Marvin N. [1 ,2 ,5 ]
机构
[1] Leibniz Inst Prevent Res & Epidemiol BIPS, Bremen, Germany
[2] Ludwig Maximilians Univ Munchen, Dept Stat, Munich, Germany
[3] Univ Bremen, Fac Math & Comp Sci, Bremen, Germany
[4] Munich Ctr Machine Learning MCML, Munich, Germany
[5] Univ Copenhagen, Dept Publ Hlth, Copenhagen, Denmark
关键词
competing risks; Cox regression; high-dimensional data analysis; penalized regression; variable selection; REGULARIZATION PATHS; FORESTS;
D O I
10.1002/bimj.70036
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Variable selection is an important step in the analysis of high-dimensional data, yet there are limited options for survival outcomes in the presence of competing risks. Commonly employed penalized Cox regression considers each event type separately through cause-specific models, neglecting possibly shared information between them. We adapt the feature-weighted elastic net (fwelnet), an elastic net generalization, to survival outcomes and competing risks. For two causes, our proposed algorithm fits two alternating cause-specific models, where each model receives the coefficient vector of the complementary model as prior information. We dub this "cooperative penalized regression," as it enables the modeling of competing risk data with cause-specific models while accounting for shared effects between causes. Coefficients that are shrunken toward zero in the model for the first cause will receive larger penalization weights in the model for the second cause and vice versa. Through multiple iterations, this process ensures stronger penalization of uninformative predictors in both models. We demonstrate our method's variable selection capabilities on simulated genomics data and apply it to bladder cancer microarray data. We evaluate selection performance using the positive predictive value for the correct selection of informative features and the false positive rate for the selection of uninformative variables. The benchmark compares results with cause-specific penalized Cox regression, random survival forests, and likelihood-boosted Cox regression. Results indicate that our approach is more effective at selecting informative features and removing uninformative features. In settings without shared effects, variable selection performance is similar to cause-specific penalized Cox regression.
引用
收藏
页数:12
相关论文
共 23 条
[1]   Penalized estimation for competing risks regression with applications to high-dimensional covariates [J].
Ambrogi, Federico ;
Scheike, Thomas H. .
BIOSTATISTICS, 2016, 17 (04) :708-721
[2]  
Bender A, 2018, Arxiv, DOI arXiv:1806.01042
[3]   Generating survival times to simulate Cox proportional hazards models [J].
Bender, R ;
Augustin, T ;
Blettner, M .
STATISTICS IN MEDICINE, 2005, 24 (11) :1713-1723
[4]   Simulating competing risks data in survival analysis [J].
Beyersmann, Jan ;
Latouche, Aurelien ;
Buchholz, Anika ;
Schumacher, Martin .
STATISTICS IN MEDICINE, 2009, 28 (06) :956-971
[5]   Allowing for mandatory covariates in boosting estimation of sparse high-dimensional survival models [J].
Binder, Harald ;
Schumacher, Martin .
BMC BIOINFORMATICS, 2008, 9 (1)
[6]  
Binder H, 2008, STAT APPL GENET MOL, V7
[7]   Boosting for high-dimensional time-to-event data with competing risks [J].
Binder, Harald ;
Allignol, Arthur ;
Schumacher, Martin ;
Beyersmann, Jan .
BIOINFORMATICS, 2009, 25 (07) :890-896
[8]   Evaluation of variable selection methods for random forests and omics data sets [J].
Degenhardt, Frauke ;
Seifert, Stephan ;
Szymczak, Silke .
BRIEFINGS IN BIOINFORMATICS, 2019, 20 (02) :492-503
[9]   A molecular signature in superficial bladder carcinoma predicts clinical outcome [J].
Dyrskjot, L ;
Zieger, K ;
Kruhoffer, M ;
Thykjaer, T ;
Jensen, JL ;
Primdahl, H ;
Aziz, N ;
Marcussen, N ;
Moller, K ;
Orntoft, AF .
CLINICAL CANCER RESEARCH, 2005, 11 (11) :4029-4036
[10]   Regularization Paths for Generalized Linear Models via Coordinate Descent [J].
Friedman, Jerome ;
Hastie, Trevor ;
Tibshirani, Rob .
JOURNAL OF STATISTICAL SOFTWARE, 2010, 33 (01) :1-22