Is the area under an ROC curve a valid measure of the performance of a screening or diagnostic test?

被引:48
|
作者
Wald, N. J. [1 ]
Bestwick, J. P. [1 ]
机构
[1] Barts & London Queen Marys Sch Med & Dent, Wolfson Inst Prevent Med, London EC1M 6BQ, England
关键词
ROC curve; AUC; screening test; diagnostic test;
D O I
10.1177/0969141313517497
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Objectives: The area under a receiver operating characteristic (ROC) curve (the AUC) is used as a measure of the performance of a screening or diagnostic test. We here assess the validity of the AUC. Methods: Assuming the test results follow Gaussian distributions in affected and unaffected individuals, standard mathematical formulae were used to describe the relationship between the detection rate (DR) (or sensitivity) and the false-positive rate (FPR) of a test with the AUC. These formulae were used to calculate the screening performance (DR for a given FPR, or FPR for a given DR) for different AUC values according to different standard deviations of the test result in affected and unaffected individuals. Results: The DR for a given FPR is strongly dependent on relative differences in the standard deviation of the test variable in affected and unaffected individuals. Consequently, two tests with the same AUC can have a different DR for the same FPR. For example, an AUC of 0.75 has a DR of 24% for a 5% FPR if the standard deviations are the same in affected and unaffected individuals, but 39% for the same 5% FPR if the standard deviation in affected individuals is 1.5 times that in unaffected individuals. Conclusion: The AUC is an unreliable measure of screening performance because in practice the standard deviation of a screening or diagnostic test in affected and unaffected individuals can differ. The problem is avoided by not using AUC at all, and instead specifying DRs for given FPRs or FPRs for given DRs.
引用
收藏
页码:51 / 56
页数:6
相关论文
共 50 条
  • [21] Mutual Information as a Performance Measure for Binary Predictors Characterized by Both ROC Curve and PROC Curve Analysis
    Hughes, Gareth
    Kopetzky, Jennifer
    McRoberts, Neil
    ENTROPY, 2020, 22 (09)
  • [22] ESTIMATION OF AREA UNDER THE ROC CURVE UNDER NONIGNORABLE VERIFICATION BIAS
    Yu, Wenbao
    Kim, Jae Kwang
    Park, Taesung
    STATISTICA SINICA, 2018, 28 (04) : 2149 - 2166
  • [23] A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve
    Qian M. Zhou
    Lu Zhe
    Russell J. Brooke
    Melissa M. Hudson
    Yan Yuan
    Diagnostic and Prognostic Research, 5 (1)
  • [24] Combining biomarkers linearly and nonlinearly for classification using the area under the ROC curve
    Fong, Youyi
    Yin, Shuxin
    Huang, Ying
    STATISTICS IN MEDICINE, 2016, 35 (21) : 3792 - 3809
  • [25] Nonparametric additive model with grouped lasso and maximizing area under the ROC curve
    Choi, Sungwoo
    Park, Junyong
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 77 : 313 - 325
  • [26] A modified area under the ROC curve and its application to marker selection and classification
    WenBao Yu
    Yuan-chin Ivan Chang
    Eunsik Park
    Journal of the Korean Statistical Society, 2014, 43 : 161 - 175
  • [27] A Simple Generalisation of the Area Under the ROC Curve for Multiple Class Classification Problems
    David J. Hand
    Robert J. Till
    Machine Learning, 2001, 45 : 171 - 186
  • [28] A simple generalisation of the area under the ROC curve for multiple class classification problems
    Hand, DJ
    Till, RJ
    MACHINE LEARNING, 2001, 45 (02) : 171 - 186
  • [29] A modified area under the ROC curve and its application to marker selection and classification
    Yu, WenBao
    Chang, Yuan-chin Ivan
    Park, Eunsik
    JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2014, 43 (02) : 161 - 175
  • [30] A modified Wald interval for the area under the ROC curve (AUC) in diagnostic case-control studies
    Martina Kottas
    Oliver Kuss
    Antonia Zapf
    BMC Medical Research Methodology, 14