Youden index and optimal cut-point estimated from observations affected by a lower limit of detection

被引:896
作者
Ruopp, Marcus D. [1 ]
Perkins, Neil J. [1 ]
Whitcomb, Brian W. [1 ]
Schisterman, Enrique F. [1 ]
机构
[1] NICHHD, Natl Inst Hlth, Div Epidemiol Stat & Prevent Res, DHHS, Bethesda, MD 20892 USA
关键词
Youden Index; ROC curve; sensitivity and specificity; optimal cut-point;
D O I
10.1002/bimj.200710415
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The receiver operating characteristic (ROC) curve is used to evaluate a biomarker's ability for classifying disease status. The Youden Index (J), the maximum potential effectiveness of a biomarker, is a common summary measure of the ROC curve. In biomarker development, levels may be unquantifiable below a limit of detection (LOD) and missing from the overall dataset. Disregarding these observations may negatively bias the ROC curve and thus J. Several correction methods have been suggested for mean estimation and testing; however, little has been written about the ROC curve or its summary measures. We adapt non-parametric (empirical) and semi-parametric (ROC-GLM [generalized linear model]) methods and propose parametric methods (maximum likelihood (ML)) to estimate J and the optimal cut-point (c*) for a biomarker affected by a LOD. We develop unbiased estimators of J and c* via ML for normally and gamma distributed biomarkers. Alpha level confidence intervals are proposed using delta and bootstrap methods for the ML, semi-parametric, and non-parametric approaches respectively. Simulation studies are conducted over a range of distributional scenarios and sample sizes evaluating estimators' bias, root-mean square error, and coverage probability; the average bias was less than one percent for ML and GLM methods across scenarios and decreases with increased sample size. An example using polychlorinated biphenyl levels to classify women with and without endometriosis illustrates the potential benefits of these methods. We address the limitations and usefulness of each method in order to give researchers guidance in constructing appropriate estimates of biomarkers' true discriminating capabilities.
引用
收藏
页码:419 / 430
页数:12
相关论文
共 21 条
  • [1] [Anonymous], 2004, SPRINGER TEXTS STAT
  • [2] ESTIMATING THE MEAN AND VARIANCE OF NORMAL POPULATIONS FROM SINGLY TRUNCATED AND DOUBLY TRUNCATED SAMPLES
    COHEN, AC
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1950, 21 (04): : 557 - 569
  • [3] Faraggi D, 2000, STAT MED, V19, P61, DOI 10.1002/(SICI)1097-0258(20000115)19:1<61::AID-SIM297>3.3.CO
  • [4] 2-1
  • [6] ASYMPTOTIC VARIANCES AND COVARIANCES OF MAXIMUM-LIKELIHOOD ESTIMATORS FROM CENSORED SAMPLES OF PARAMETERS OF WEIBULL AND GAMMA POPULATIONS
    HARTER, HL
    MOORE, AH
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1967, 38 (02): : 557 - &
  • [7] HARTER HL, 1966, BIOMETRIKA, V53, P205
  • [8] NONDETECTS, DETECTION LIMITS, AND THE PROBABILITY OF DETECTION
    LAMBERT, D
    PETERSON, B
    TERPENNING, I
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1991, 86 (414) : 266 - 277
  • [9] Environmental PCB exposure and risk of endometriosis
    Louis, GMB
    Weiner, JM
    Whitcomb, BW
    Sperrazza, R
    Schisterman, EF
    Lobdell, DT
    Crickard, K
    Greizerstein, H
    Kostyniak, PJ
    [J]. HUMAN REPRODUCTION, 2005, 20 (01) : 279 - 285
  • [10] Comparing the areas under two correlated ROC curves: Parametric and non-parametric approaches
    Molodianovitch, Katy
    Faraggi, David
    Reiser, Benjamin
    [J]. BIOMETRICAL JOURNAL, 2006, 48 (05) : 745 - 757