Investigating the prediction ability of survival models based on both clinical and omics data: two case studies

被引:36
作者
De Bin, Riccardo [1 ]
Sauerbrei, Willi [2 ]
Boulesteix, Anne-Laure [1 ]
机构
[1] Univ Munich, Dept Med Informat Biometry & Epidemiol, D-81377 Munich, Germany
[2] Univ Med Ctr Freiburg, Dept Med Biometry & Med Informat, Freiburg, Germany
关键词
clinical information; combining clinical and omics data; high-dimensional data; prediction models; survival analysis; GENE-EXPRESSION ANALYSIS; VARIABLE SELECTION; GENOMIC MODELS; TIME; CLASSIFICATION; PERFORMANCE; VALIDATION; REGRESSION; REGULARIZATION; MICROARRAYS;
D O I
10.1002/sim.6246
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In biomedical literature, numerous prediction models for clinical outcomes have been developed based either on clinical data or, more recently, on high-throughput molecular data (omics data). Prediction models based on both types of data, however, are less common, although some recent studies suggest that a suitable combination of clinical and molecular information may lead to models with better predictive abilities. This is probably due to the fact that it is not straightforward to combine data with different characteristics and dimensions (poorly characterized high-dimensional omics data, well-investigated low-dimensional clinical data). In this paper, we analyze two publicly available datasets related to breast cancer and neuroblastoma, respectively, in order to show some possible ways to combine clinical and omics data into a prediction model of time-to-event outcome. Different strategies and statistical methods are exploited. The results are compared and discussed according to different criteria, including the discriminative ability of the models, computed on a validation dataset. Copyright (c) 2014 John Wiley & Sons, Ltd.
引用
收藏
页码:5310 / 5329
页数:20
相关论文
共 51 条
[11]   An empirical assessment of validation practices for molecular classifiers [J].
Castaldi, Peter J. ;
Dahabreh, Issa J. ;
Ioannidis, John P. A. .
BRIEFINGS IN BIOINFORMATICS, 2011, 12 (03) :189-202
[12]  
Dobbin KK, 2005, CLIN CANCER RES, V11, P565
[13]   Regularization Paths for Generalized Linear Models via Coordinate Descent [J].
Friedman, Jerome ;
Hastie, Trevor ;
Tibshirani, Rob .
JOURNAL OF STATISTICAL SOFTWARE, 2010, 33 (01) :1-22
[14]   Greedy function approximation: A gradient boosting machine [J].
Friedman, JH .
ANNALS OF STATISTICS, 2001, 29 (05) :1189-1232
[15]   Estimating a time-dependentconcordance index for survival prediction models with covariate dependent censoring [J].
Gerds, Thomas A. ;
Kattan, Michael W. ;
Schumacher, Martin ;
Yu, Changhong .
STATISTICS IN MEDICINE, 2013, 32 (13) :2173-2184
[16]  
Graf E, 1999, STAT MED, V18, P2529
[17]  
Harrell FE, 1996, STAT MED, V15, P361, DOI 10.1002/(SICI)1097-0258(19960229)15:4<361::AID-SIM168>3.0.CO
[18]  
2-4
[19]   A Genomic Predictor of Response and Survival Following Taxane-Anthracycline Chemotherapy for Invasive Breast Cancer [J].
Hatzis, Christos ;
Pusztai, Lajos ;
Valero, Vicente ;
Booser, Daniel J. ;
Esserman, Laura ;
Lluch, Ana ;
Vidaurre, Tatiana ;
Holmes, Frankie ;
Souchon, Eduardo ;
Wang, Hongkun ;
Martin, Miguel ;
Cotrina, Jose ;
Gomez, Henry ;
Hubbard, Rebekah ;
Ignacio Chacon, J. ;
Ferrer-Lozano, Jaime ;
Dyer, Richard ;
Buxton, Meredith ;
Gong, Yun ;
Wu, Yun ;
Ibrahim, Nuhad ;
Andreopoulou, Eleni ;
Ueno, Naoto T. ;
Hunt, Kelly ;
Yang, Wei ;
Nazario, Arlene ;
DeMichele, Angela ;
O'Shaughnessy, Joyce ;
Hortobagyi, Gabriel N. ;
Symmans, W. Fraser .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2011, 305 (18) :1873-1881
[20]   An eight-gene expression signature for the prediction of survival and time to treatment in chronic lymphocytic leukemia [J].
Herold, T. ;
Jurinovic, V. ;
Metzeler, K. H. ;
Boulesteix, A-L ;
Bergmann, M. ;
Seiler, T. ;
Mulaw, M. ;
Thoene, S. ;
Dufour, A. ;
Pasalic, Z. ;
Schmidberger, M. ;
Schmidt, M. ;
Schneider, S. ;
Kakadia, P. M. ;
Feuring-Buske, M. ;
Braess, J. ;
Spiekermann, K. ;
Mansmann, U. ;
Hiddemann, W. ;
Buske, C. ;
Bohlander, S. K. .
LEUKEMIA, 2011, 25 (10) :1639-1645