Modernising Receiver Operating Characteristic (ROC) Curves

被引:9
作者
Pendrill, Leslie R. [1 ]
Melin, Jeanette [1 ,2 ]
Stavelin, Anne [3 ]
Nordin, Gunnar [4 ]
机构
[1] RISE Res Inst Sweden, Div Safety & Transport, Measurement Sci & Technol, S-41258 Gothenburg, Sweden
[2] Swedish Def Univ, Dept Leadership Demand & Control, S-65340 Karlstad, Sweden
[3] Norwegian Org Qual Improvement Lab Examinat, Noklus, Boks 6165, N-5892 Bergen, Norway
[4] Equalis, Box 977, SE-75109 Uppsala, Sweden
基金
欧盟地平线“2020”;
关键词
measurement system analysis; rating ability; ordinality; receiver operating characteristic; decision risks;
D O I
10.3390/a16050253
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The justification for making a measurement can be sought in asking what decisions are based on measurement, such as in assessing the compliance of a quality characteristic of an entity in relation to a specification limit, SL. The relative performance of testing devices and classification algorithms used in assessing compliance is often evaluated using the venerable and ever popular receiver operating characteristic (ROC). However, the ROC tool has potentially all the limitations of classic test theory (CTT) such as the non-linearity, effects of ordinality and confounding task difficulty and instrument ability. These limitations, inherent and often unacknowledged when using the ROC tool, are tackled here for the first time with a modernised approach combining measurement system analysis (MSA) and item response theory (IRT), using data from pregnancy testing as an example. The new method of assessing device ability from separate Rasch IRT regressions for each axis of ROC curves is found to perform significantly better, with correlation coefficients with traditional area-under-curve metrics of at least 0.92 which exceeds that of linearised ROC plots, such as Linacre's, and is recommended to replace other approaches for device assessment. The resulting improved measurement quality of each ROC curve achieved with this original approach should enable more reliable decision-making in conformity assessment in many scenarios, including machine learning, where its use as a metric for assessing classification algorithms has become almost indispensable.
引用
收藏
页数:22
相关论文
共 27 条
[1]   Current status of the lateral flow immunoassay for the detection of SARS-CoV-2 in nasopharyngeal swabs [J].
Bacura, Anita Somborac ;
Dorotic, Marija ;
Grosic, Leonarda ;
Dzimbeg, Monika ;
Dodig, Slavica .
BIOCHEMIA MEDICA, 2021, 31 (02)
[2]   Ability evaluation by binary tests: Problems, challenges & recent advances [J].
Bashkansky, E. ;
Turetsky, V. .
2016 JOINT IMEKO TC1-TC7-TC13 SYMPOSIUM: METROLOGY ACROSS THE SCIENCES: WISHFUL THINKING?, 2016, 772
[3]  
Birdsall T.G., 1973, The theory of signal detectability: Roc curves and their character
[4]  
Cipriani Daniel, 2005, J Appl Meas, V6, P180
[5]  
E11 Committee, 2022, GUID MEAS SYST AN MS
[6]  
Fisher W.P., 2023, SPRINGER SERIES MEAS, P311
[7]  
Fisher William P Jr, 2010, J Appl Meas, V11, P271
[8]  
Hibbert B., 2013, Chem. Int, V35, P22, DOI DOI 10.1515/CI.2013.35.2.22
[9]   Lateral flow assays [J].
Koczula, Katarzyna M. ;
Gallotta, Andrea .
BIOSENSOR TECHNOLOGIES FOR DETECTION OF BIOMOLECULES, 2016, 60 (01) :111-120
[10]  
KROUWER JS, 1987, CLIN CHEM, V33, P2305