Overestimation of the receiver operating characteristic curve for logistic regression

被引:53
作者
Copas, JB [1 ]
Corbett, P [1 ]
机构
[1] Univ Warwick, Dept Stat, Coventry CV4 7AL, W Midlands, England
关键词
logistic regression; ROC; screening score; shrinkage;
D O I
10.1093/biomet/89.2.315
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Logistic regression is often used to find a linear combination of covariates which best discriminates between two groups or populations. The ROC, receiver operating characteristic, curve is a good way of assessing the performance of the resulting score, but using the same data both to fit the score and to calculate its ROC leads to an over-optimistic estimate of the performance which the score would give if it were to be validated on a sample of future cases. The paper studies the extent of this overestimation, and suggests a shrinkage correction for the ROC curve itself and for the area under the curve. The correction is consistent with Efron's formula for the bias in the error rate of a binary prediction rule. Two medical examples are discussed.
引用
收藏
页码:315 / 331
页数:17
相关论文
共 11 条
[1]   STATISTICAL STUDIES OF PROGNOSIS IN ADVANCED BREAST CANCER [J].
ARMITAGE, P ;
MCPHERSO.CK ;
COPAS, JC .
JOURNAL OF CHRONIC DISEASES, 1969, 22 (05) :343-&
[2]   A new strategy for evaluating the impact of epidemiologic risk factors for cancer with application to melanoma [J].
Begg, CB ;
Satagopan, JM ;
Berwick, M .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1998, 93 (442) :415-426
[3]   Screening for cutaneous melanoma by skin self-examination [J].
Berwick, M ;
Begg, CB ;
Fine, JA ;
Roush, GC ;
Barnhill, RL .
JOURNAL OF THE NATIONAL CANCER INSTITUTE, 1996, 88 (01) :17-23
[4]   PROSPECTIVE ANALYSIS OF LOGISTIC CASE-CONTROL STUDIES [J].
CARROLL, RJ ;
WANG, SJ ;
WANG, CY .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1995, 90 (429) :157-169
[5]   The offender group reconviction scale: a statistical reconviction score for use by probation officers [J].
Copas, J ;
Marshall, P .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 1998, 47 :159-171
[6]  
COPAS JB, 1983, J R STAT SOC B, V45, P311
[8]   Rank statistics expressible as integrals under P-P-plots and receiver operating characteristic curves [J].
Girling, AJ .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2000, 62 :367-382
[9]  
GREEN D, 1988, SIGNAL DETECTION THE
[10]  
LEHMANN EL, 1976, NONPARAMETRICS STAT