Identifying Important Risk Factors for Survival in Patient With Systolic Heart Failure Using Random Survival Forests

被引:122
作者
Hsich, Eileen [2 ,4 ]
Gorodeski, Eiran Z. [2 ]
Blackstone, Eugene H. [2 ,3 ,4 ]
Ishwaran, Hemant [3 ]
Lauer, Michael S. [1 ]
机构
[1] NHLBI, Div Cardiovasc Sci, NIH, Rockledge Ctr 2, Bethesda, MD 20892 USA
[2] Inst Heart & Vasc, Cleveland, OH USA
[3] Dept Quantitat Hlth Sci, Cleveland, OH USA
[4] Case Western Reserve Univ, Sch Med, Cleveland, OH USA
来源
CIRCULATION-CARDIOVASCULAR QUALITY AND OUTCOMES | 2011年 / 4卷 / 01期
关键词
heart failure; prognosis; statistics; survival analyses; AMBULATORY PATIENTS; PREDICT SURVIVAL; CLINICAL INDEX; MORTALITY; SCORE; MODEL; CLASSIFICATION; ASSOCIATION; EVENTS;
D O I
10.1161/CIRCOUTCOMES.110.939371
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background-Heart failure survival models typically are constructed using Cox proportional hazards regression. Regression modeling suffers from a number of limitations, including bias introduced by commonly used variable selection methods. We illustrate the value of an intuitive, robust approach to variable selection, random survival forests (RSF), in a large clinical cohort. RSF are a potentially powerful extensions of classification and regression trees, with lower variance and bias. Methods and Results-We studied 2231 adult patients with systolic heart failure who underwent cardiopulmonary stress testing. During a mean follow-up of 5 years, 742 patients died. Thirty-nine demographic, cardiac and noncardiac comorbidity, and stress testing variables were analyzed as potential predictors of all-cause mortality. An RSF of 2000 trees was constructed, with each tree constructed on a bootstrap sample from the original cohort. The most predictive variables were defined as those near the tree trunks (averaged over the forest). The RSF identified peak oxygen consumption, serum urea nitrogen, and treadmill exercise time as the 3 most important predictors of survival. The RSF predicted survival similarly to a conventional Cox proportional hazards model (out-of-bag C-index of 0.705 for RSF versus 0.698 for Cox proportional hazards model). Conclusions-An RSF model in a cohort of patients with heart failure performed as well as a traditional Cox proportional hazard model and may serve as a more intuitive approach for clinicians to identify important risk factors for all-cause mortality. (Circ Cardiovasc Qual Outcomes. 2011;4:39-45.)
引用
收藏
页码:39 / 45
页数:7
相关论文
共 30 条
[1]   Development and prospective validation of a clinical index to predict survival in ambulatory patients referred for cardiac transplant evaluation [J].
Aaronson, KD ;
Schwartz, JS ;
Chen, TM ;
Wong, KL ;
Goin, JE ;
Mancini, DM .
CIRCULATION, 1997, 95 (12) :2660-2667
[2]   Logistic regression had superior performance compared with regression trees for predicting in-hospital mortality in patients hospitalized with heart failure [J].
Austin, Peter C. ;
Tu, Jack V. ;
Lee, Douglas S. .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2010, 63 (10) :1145-1155
[3]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[4]  
Breiman L, 1996, ANN STAT, V24, P2350
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]   A multivariate model for predicting mortality in patients with heart failure and systolic dysfunction [J].
Brophy, JM ;
Dagenais, GR ;
McSherry, F ;
Williford, W ;
Yusuf, S .
AMERICAN JOURNAL OF MEDICINE, 2004, 116 (05) :300-304
[8]   PREDICTION OF CREATININE CLEARANCE FROM SERUM CREATININE [J].
COCKCROFT, DW ;
GAULT, MH .
NEPHRON, 1976, 16 (01) :31-41
[9]  
HARRELL FE, 2001, REGRESSION MODELING, P49
[10]   Analysis of multiple SNPs in genetic association studies: Comparison of three multi-locus methods to prioritize and select SNPs [J].
Heidema, A. Geert ;
Feskens, Edith J. M. ;
Doevendans, Pieter A. F. M. ;
Ruven, Henk J. T. ;
Van Houwelingen, Hans C. ;
Mariman, Edwin C. M. ;
Boer, Jolanda M. A. .
GENETIC EPIDEMIOLOGY, 2007, 31 (08) :910-921