Robust combination of multiple diagnostic tests for classifying censored event times

被引:20
作者
Cai, T. [1 ]
Cheng, S. [2 ]
机构
[1] Harvard Univ, Dept Biostat, Boston, MA 02115 USA
[2] Univ Calif San Francisco, Dept Epidemiol & Biostat, San Francisco, CA 94143 USA
关键词
censored surival time; nonparametric transformation models; resampling; robust estimation; time-specific classification accuracy;
D O I
10.1093/biostatistics/kxm037
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Recent advancement in technology promises to yield a multitude of tests for disease diagnosis and prognosis. When there are multiple sources of information available, it is often of interest to construct a composite score that can provide better classification accuracy than any individual measurement. In this paper, we consider robust procedures for optimally combining tests when test results are measured prior to disease onset and disease status evolves over time. To account for censoring of disease onset time, the most commonly used approach to combining tests to detect subsequent disease status is to fit a proportional hazards model (Cox, 1972) and use the estimated risk score. However, simulation studies suggested that such a risk score may have poor accuracy when the proportional hazards assumption fails. We propose the use of a nonparametric transformation model (Han, 1987) as a working model to derive an optimal composite score with theoretical justification. We demonstrate that the proposed score is the optimal score when the model holds and is optimal "on average" among linear scores even if the model fails. Time-dependent sensitivity, specificity, and receiver operating characteristic curve functions are used to quantify the accuracy of the resulting composite score. We provide consistent and asymptotically Gaussian estimators of these accuracy measures. A simple model-free resampling procedure is proposed to obtain all consistent variance estimators. We illustrate the new proposals with simulation studies and an analysis of a breast cancer gene expression data set.
引用
收藏
页码:216 / 233
页数:18
相关论文
共 45 条
[1]   Pairwise-difference rank estimation of the transformation model [J].
Abrevaya, J .
JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2003, 21 (03) :437-447
[2]   Singular value decomposition for genome-wide expression data processing and modeling [J].
Alter, O ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (18) :10101-10106
[3]  
[Anonymous], APPL STAT
[4]   The sensitivity and specificity of markers for event times [J].
Cai, TX ;
Pepe, MS ;
Zheng, YY ;
Lumley, T ;
Jenny, NS .
BIOSTATISTICS, 2006, 7 (02) :182-197
[5]   Rank estimators for monotonic index models [J].
Cavanagh, C ;
Sherman, RP .
JOURNAL OF ECONOMETRICS, 1998, 84 (02) :351-381
[6]   Semiparametric analysis of transformation models with censored data [J].
Chen, KN ;
Jin, ZZ ;
Ying, ZL .
BIOMETRIKA, 2002, 89 (03) :659-668
[7]   Rank estimation of transformation models [J].
Chen, SN .
ECONOMETRICA, 2002, 70 (04) :1683-1697
[8]   Analysis of transformation models with censored data [J].
Cheng, SC ;
Wei, LJ ;
Ying, Z .
BIOMETRIKA, 1995, 82 (04) :835-845
[9]  
COX DR, 1972, J R STAT SOC B, V34, P187
[10]   Semiparametric estimation of a censored regression model with an unknown transformation of the dependent variable [J].
Gorgens, T ;
Horowitz, JL .
JOURNAL OF ECONOMETRICS, 1999, 90 (02) :155-191