Exact Probability Distribution for the ROC Area under Curve

被引:13
|
作者
Ekstrom, Joakim [1 ]
Akerren Ogren, Jim [1 ]
Sjoblom, Tobias [1 ]
机构
[1] Uppsala Univ, Rudbecklab, Dept Immunol Genet Pathol, S-75257 Uppsala, Sweden
关键词
receiver operating characteristic; AUC-value distribution function; AUC p-value; exact test; OPERATING CHARACTERISTIC CURVES; CANCER BIOMARKERS;
D O I
10.3390/cancers15061788
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Simple Summary This contribution allows for the computation of exact p-values and for conducting accurate statistical hypothesis tests of ROC AUC-values. As a result, the development of diagnostic tests is facilitated. This work is illustrated via simulated data and through the development of proteomic blood biomarkers for the early detection of cancer. The Receiver Operating Characteristic (ROC) is a de facto standard for determining the accuracy of in vitro diagnostic (IVD) medical devices, and thus the exactness in its probability distribution is crucial toward accurate statistical inference. We show the exact probability distribution of the ROC AUC-value, hence exact critical values and p-values are readily obtained. Because the exact calculations are computationally intense, we demonstrate a method of geometric interpolation, which is exact in a special case but generally an approximation, vastly increasing computational speeds. The method is illustrated through open access data, demonstrating superiority of 26 composite biomarkers relative to a predicate device. Especially under correction for testing of multiple hypotheses, traditional asymptotic approximations are encumbered by considerable imprecision, adversely affecting IVD device development. The ability to obtain exact p-values will allow more efficient IVD device development.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Bounding sample size projections for the area under a ROC curve
    Blume, Jeffrey D.
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2009, 139 (03) : 711 - 721
  • [2] Robustness of Approaches to ROC Curve Modeling under Misspecification of the Underlying Probability Model
    Devlin, Sean M.
    Thomas, Elizabeth G.
    Emerson, Scott S.
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2013, 42 (20) : 3655 - 3664
  • [3] Estimating the uncertainty in the estimated mean area under the ROC curve of a classifier
    Yousef, WA
    Wagner, RF
    Loew, MH
    PATTERN RECOGNITION LETTERS, 2005, 26 (16) : 2600 - 2610
  • [4] Rank-based kernel estimation of the area under the ROC curve
    Yin, Jingjing
    Hao, Yi
    Samawi, Hani
    Rochani, Haresh
    STATISTICAL METHODOLOGY, 2016, 32 : 91 - 106
  • [5] Optimizing area under the ROC curve using semi-supervised learning
    Wang, Shijun
    Li, Diana
    Petrick, Nicholas
    Sahiner, Berkman
    Linguraru, Marius George
    Summers, Ronald M.
    PATTERN RECOGNITION, 2015, 48 (01) : 276 - 287
  • [6] Performance of tests based on the area under the ROC curve for multireader diagnostic data
    Hwang, Yi-Ting
    Hsu, Ya-Ru
    Su, Nan-Cheng
    JOURNAL OF APPLIED STATISTICS, 2025, 52 (03) : 555 - 577
  • [7] A Simple Generalisation of the Area Under the ROC Curve for Multiple Class Classification Problems
    David J. Hand
    Robert J. Till
    Machine Learning, 2001, 45 : 171 - 186
  • [8] A simple generalisation of the area under the ROC curve for multiple class classification problems
    Hand, DJ
    Till, RJ
    MACHINE LEARNING, 2001, 45 (02) : 171 - 186
  • [9] A modified area under the ROC curve and its application to marker selection and classification
    Yu, WenBao
    Chang, Yuan-chin Ivan
    Park, Eunsik
    JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2014, 43 (02) : 161 - 175
  • [10] On the analysis of glycomics mass spectrometry data via the regularized area under the ROC curve
    Jingjing Ye
    Hao Liu
    Crystal Kirmiz
    Carlito B Lebrilla
    David M Rocke
    BMC Bioinformatics, 8